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ALTERED LINQLENIC AND LINOLEIC ACID 

CONTENT IN PLANTS 
This is a continuation-in-part of U.S. Serial No. 08/156,551 
filed November 22, 1993, which is a continuation of U.S. Serial No. 
5 08/014,431, filed on Febniary 5, 1993. The present invention relates to 
genetically engineered plants. In particular it relates to genetically 
engineered plants and seeds which have altered Unolenic and linoleic acid 
content compared with naturally occizrring plants. 

PACKQfiPUNP 

10 Many crop species produce seed oils in which the fatty acid 

composition is not ideally suited to the intended use. The application of 
conventional breeding methods, coupled in some cases with mutagenesis, 
has resulted in the production of new varieties of several species with 
desirable alterations in the fatty acid composition of seed oil. A notable 

15 example is the development of low erucic acid varieties of rapeseed 
(Stefansson 1983). Similar efforts have resulted in the reduction of the 
level of polyimsaturated 18-carbon fatty acids in soybean (Wilcox and 
Cavins 1985; Graef et al. 1988), sunflower (Fick 1989), and linseed oils 
(Green and Marshal 1984). 

20 Most of the genetic variation in seed lipid fatty acid 

composition appears to involve the presence of an allele of a gene that 
disrupts normal fatty acid metabolism and leads to an accumulation of 
intermediate fatty acid products in the seed storage lipids (Downey 1987). 
However, it seems likely that, because of the inherent limitations of this 

25 approach, many other desirable changes in seed oil fatty acid composition 
may require the directed application of genetic engineering methods. 

a-Linoienic acid (18:3^^*12.15) jg an eighteen carbon fatty acid 
containing three cis double bonds at the 9-10, 12-13 and 15-16 carbons. It 
is found in the cells of higher plants as a constituent of cell membranes. It 
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is also found in storage organs, such as in seeds. There it is designated oil 
bodies which are bounded by an electron dense structure that is thought to 
be a half-unit membrane and dispersed in the c3i;oplasmic environment of 
cells. When present as a constituent of cell membranes, linolenic acid is 
5 usually esterified to the sn-1 or sn-2 position of the glycerol moiety of a 
diacyl-glycerolipid. By contrast, when present in oil bodies, linolenic add is 
usually esterified to the sn-1, sn-2 or sn-3 position of a triacylglycerolipid 
(TAG). 

Linolenic add is extensively used in the paint and varnish 
10 industzy in view of its rapid oxidation. Flax seed is a predominant source of 
this oil. Soybean seed, on the other hand, does not have sufficient linolenic 
add content to be used in this industiy. Thus, increasing the linolenic add 
content in a plant such as soybean would permit the use of the soybean oil 
in the paint and varnish industry. 
15 On the other hand, it is undesirable to have significant levels 

of linolenic add in cooking oils and foods. Linolenic add is unstable during 
cooking and is rapidly oxidized. The oxidized products impart randdity to 
the finished product. A rapeseed or soybean oil with reduced linolenic add, 
such as containing 2% or less of linolenic add, would be ideal for use as a 
20 cooking oil. 

Linolenic acid is also a precursor in the biosynthesis of 
jasmonic acid, an important plant growth regxilator. Linolenic acid is 
converted to jasmonic add by introduction of an oxygen to the carbon chain 
by a lipoxygenase, followed by dehydration, reduction, and several P- 
25 oxidations (Vick and Zimmerman, 1984). The activity of jasmonic add has 
been measured in terms of induction of pathogen defense responses. By 
application of free linolenic acid to plants, plant pathogen defenses can also 
be induced (Farmer and Ryan, 1992). 
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A model has been proposed to explain the ability of free 
linolenic acid to exhibit the effects associated with jasmonic acid (FjEirmer 
and Ryan, 1992). It is hypothesized that all of the enzymatic activities 
which are required for the conversion of hnolenic add to jasmonic add are 
5 constitutively present in the cell and the rate limiting step in the production 
of jasmonic add is the availability of free linolenic add. A likely route for 
the production of the free linolenic add is by the activity of a lipase in the 
plasma membrane. 

It has been observed that exogenous jasmonic add can more 

10 powerfully activate defense responses than can wounding. This suggests 
that wotmds cannot generate enough free linolenic add to support high level 
production of jasmonic add. The activity of the lipase or the availabiUty of 
appropriate substrate for the lipase may be rate limiting upon wotmding. 
ThvLs, increasing the linolenic acid content of plasma membrane may 

15 positively influence ^signal transduction** in plants and result in better 
protection against environment and pathogen stress. 

Linoleiuc acid, as well as oleic and linoleic acids are also 
important constituents, as well as precursors of volatile carbonyl 
compovmds, whic contribute to the aroma of both fresh and cooked foods. 

20 The major fatty adds of tomato fruit pericarp are oleic, linoleic and linolenic 
adds. As the fruit ripens, the levels of the latter two fatty adds decline 
resulting in the production of a! nimiber of 4-6 carbon containing aldehydees 
and ketones. One particular metabolite, c£5-3-hexanol, has been shown to 
be present in higher levels in vine-ripened tomatoes compared to 

25 supermarket tomatoes or tomatoes stored in refrigerators. It is likely, 
therefore, that the "aroma** of fresh fruits and vegetables can be 
"modiilated" by regulation of the content of linolenic and linoleic acids, 
important substrates for the enzjnne lipoxygenase and subsequently the 
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hydroperoxide cleaving enzyme, which generates the volatile "aroma" 
compounds. 

From the above, it is clear that the ability to vary the content 
of linolenic add in plants wotdd be desirable. However, to achieve this 
5 result it is necessary to determine what controls the product of linolenic 
add in plants. 

A large body of experimental evidence derived from 
radiochemical tracer studies has indicated that a-linolenic acid is 
sjrnthesized by the desaturation of linoleic add (18:2^»12) (reviewed in 
10 Harwood 1988;). However, the actual substrate for desaturation is not 
knowTL 

In vivo and in vitro labelling studies suggest that there are 
possibly two distinct pathways for the sjmthesis of linolenic add (Browse 
and SomervUle, 1991). One possible pathway is thought to be located in the 

15 endoplasmic reticultun where Unoleic add esterified to the sn-2 position of 
phosphatidylcholine is a substrate for desaturation. However, the 
available evidence does not exclude the possibility that linoleic acid 
esterified to other lipids may also be a substrate. 

A second possible pathway of linoleic add desaturation is 

20 located in the plastid where the available evidence suggests that linoleic 
add esterified to monogalactosyldiacylglycerol and, possibly, other plastid 
lipids is the substrate for desaturation. 

Relatively Uttle direct information is available concerning the 
enzymes involved in linoleic acid desaturation. Low levels of enzyme 

25 activity have been detected in microsomal membrane preparations from 
developing Unseed (Linum ussitatton) (Browse and Slack, 1981) and, more 
recently, in preparations of gently lysed chloroplasts (Schmidt and Heinz, 
1990a,b). The general features of the enzjone may be inferred from 
information available about other enzymes of this class. 
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The most thoroughly characterized desaturase is the stearoyl- 
Coenzyme A (CoA) desattirase from vertebrate liver (reviewed by 
Holloway, 1983). This enzyme has been shown to be an integral membrane 
protein which contains non-heme iron. The desaturase reaction requires 
5 fatty acyl-CoA, molecular oxygen and reduced cytochrome b5, another 
membrane protein. In vivo^ the reduced cytochrome b5 is produced by the 
transfer of reducing equivalents from NADH via the activity of 
cytochrome b5 reductase, a flavin containing membrane protein. 

The most thoroughly characterized desaturase from plants is 

10 the stearoyl-ACP desaturase (McKeon and Stumpf, 1982; Shanklin and 
Somerville, 1991). This enzyme also requires molecular oxygen and a high 
potential reductant. However^ in contrast to the animal enzyme, this 
desaturase is a soluble plastid protein which preferentially acts on a fatty 
add esterified to acyl carrier protein (ACP) rather than CoA. This enzyme 

15 also differs from the animal enzyme by utilizing reduced ferredoxin as an 
intermediate electron donor. 

Other plant desaturases appear to be membrane proteins. 
The microsomal A12 oleate desaturase from several plant species has been 
assayed in membrane preparations from several plants (Harwood, 1988), 

20 As with the stearoyl-CoA desaturase from animals, this enzyme requires 
molecular o^gen and reduced cytochrome b5 as an electron donor (Keams 
et al., 1991). However, it appears that oleate esterified to a phospholipid is 
the substrate rather than a CoA ester. 

With regard to the activity responsible for the making of 

25 linolenic add, little was known as to its source or origin. However, evidence 
that the amount of linolenic add is related to the amount of linoleic acid 
desaturase activity has been obtained by analysis of the properties of the 
fadS mutant of Arabidopsis thaliana (Lemiexix et al. 1990). This mutant is 
deficient in linolenic add in the storage oils of its seed lipids and in the 
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membrane lipids of different tissues to varying degrees. The mutant also 
had an increase in the amount of linoleic acid. This can be interpreted as 
evidence that the mutant is defective in the activity of a desaturase which 
converts linoleic add to linolenic add. 
5 There is further evidence to suggest that the activity of this 

desaturase could be rate limiting for linolenic add S3mthesis under normal 
drcumstances. This was discovered by measuring the effects on fatty add 
composition in heterozygous plants (i«e., fad3+/fad-) formed by crossing the 
wild type with the fad3 mutant. In these Fl plants, which have one copy of 
10 the normal fadS gene product instead of the two normally fo\md in the wild 
type, the amount of linolenic add was almost exactly intermediate between 
that found in either parent. This suggests that the amount of linolenic add 
is proportional to the amount of functional fadS gene product (Lemieux et 
al., 1990). 

15 These results do not shed any light, however, on the nature of 

the fadS gene product or whether the observed effects in mutants are 
related to either a decrease in quantitiy of desaturase protein or desaturase 
activity due to a defective protein. 

Moreover, nothing is known with any degree of certainty 

20 about the linoleic acid desaturase from plant microsomes. As noted above, 
veiy little is known about the microsomal desaturases except that they 
probably utilize reduced c3rtochrome b5 as intermediate electron donor and 
probably utilize Upids rather than CoA or ACP esters as substrates. 

Moreover, as in many other aspects of plant biology, the lack 

25 of specific information about the biochemistry and regulation of lipid 
metabolism makes it difiicult to predict how the introduction of one or a few 
genes might usefully alter seed lipid synthesis. 

An additional problem arises from the fact that many of the 
key enzymes of lipid metabolism are membrane-bound and present in low 
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quantities. Thus, attempts to solubilize and ptirify them from plant 
sources have not been successful, 
SUMMARY OF THE INVENTION 

The present invention provides structural coding sequences 
5 encoding linoleic add desatiurase activity which can be used to alter the 
Hnoleic and linolenic acid compositions of plants or to isolate other plant 
linoleic acid desaturases. The present invention further provides a plant 
capable of expressing a structural coding sequence to control the level of 
linolenic acid or linoleic add or both in the plant. The present invention 

10 further provides a method for controlling the levels of linoleic and linolenic 
add in plants. It is also demonstrated by the present invention that the 
linoleic add desaturase enzyme activity in plant cells and tissues is a 
controlling step in linolenic add biosynthesis. 

The present invention further relates to the engineering of two 

15 advantageous traits into plants: increased and decreased a-linolenic add 
content in the structural lipids or storage oils of various crop plants. 

In accomplishing the foregoing, there is provided, in 
accordance with one aspect of the present invention, a genetically 
transformed plant which has an elevated linolenic add content comprising 

20 a recombinant, double-stranded DNA molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably hnked to; 

(ii) a structural coding sequence that causes the 
25 production of an RNA sequence that encodes a linoleic 

acid desaturase activity; and 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 
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In accordance with another aspect of the present invention, 
there is provided a genetically transformed plant which has a reduced 
linolenic acid content, comprising a recombinant, double-stranded DNA 
molecule comprising 
5 (x) a promoter that functions in plant cells to cause 

the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a DNA sequence that causes the production of an 
RNA sequence that is in antisense orientation to at least 
10 a portion of a gene that encodes a Unoleic add desaturase 

activity in said plant; and 

(xii) a 3* non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

15 There has also been provided, in accordance with another aspect 

of the present invention a method of producing a genetically transformed 
plant which has an elevated or reduced linolenic acid content. There has 
also been provided, in accordance with another aspect of the present 
invention a recombinant, double-stranded DNA molecule and plant cells 

20 containing a recombinant, double-stranded DNA molecxile. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the genetic map of the region of chromosome 2 of 
Arabidopsis thaliana where a Unoleic acid desaturase gene is located and 
the identity of the yeast artificial chromosomes which carry this region of 

25 the genome. 

Figure 2 shows the structure of plasmid pBNDES3 which was 

obtained by inserting an EcoRI fragment containing the B. napus Unoleic 
acid desaturase cDNA (fad3) into pBLUESCRIPT. 
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Figure 3 shows the nucleotide sequence (SEQ ID N0:1) and 
deduced amino add sequence (SEQ ID N0:2) for the linoleic add desaturase 
cDNA (fadS) from B. napus. 

Figure 4 shows a comparison of the deduced amino add sequence 
5 of one linoleic add desaturase cDNA (fadS) from B. napus and the desA 
gene from Synechocystis. Identical residues are indicated by a solid box. 
Conservative substitutions are indicated by a stippled box. 

Figure 5 shows the binary Ti plasmid vector pBI121. 

Figure 6 shows the binary Ti plasmid pTiDES3 which was 
10 constructed by insertion of a linoleic add desatta*ase cDNA (fadS) into 
pBI121. 

Figure 7 shows the map of the plant transformation vector 
pMON13804. 

Figure 8 shows the map of the plant transformation vector 
15 pMON13805. 

Figure 9 shows the oil content of control and transformed canola 
seed in accordance with the present invention. 

Figure 10 shows the nucleotide sequence (SEQ ID NO:9) for the 
linoleic add desaturase cDNA (fadD) from Arabidopsis. 
20 Figure 11 shows the deduced amino acid sequence (SEQ ID 

NO:10) for the linoleic add desatiarase cDNA (fadD) from Arabidopsis. 

Figure 12 shows the nucleotide sequence (SEQ ID NO:ll) for the 
Unoleic acid desaturase cDNA (fadE) from Arabidopsis. 

Figure 13 shows the deduced amino add sequence (SEQ ID 
25 NO:12) for the linoleic add desaturase cDNA (fadE) from Arabidopsis. 
DETAILED DESCRIPTION OF THE INVENTION 

A genetically transformed plant of the present invention which 
has an altered linolenic or linoleic acid content can be obtained by 
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expressing the double-stranded DNA molecules described in this 
application. 

The expression of a double-stranded DNA involves transcription 
of messenger RNA (mRNA) from one strand of the DNA by RNA 
5 polymerase enzyme, and the subsequent processing of the mRNA primary 
transcript inside the nucleus. This processing involves a 3' non*translated 
region which adds polyadenylate nucleotides to the 3' end of the RNA 
firomot^rs 

Transcription of DNA into mRNA is regulated by a region of 
10 DNA usually referred to as the "promoter." The promoter region contains a 
sequence of bases that signals RNA polsrmerase to associate with the 
DNA, and to initiate the transcription of mRNA using one of the DNA 
strands as a template to make a corresponding complementary strand of 
RNA. 

15 Any promoter which is known or is found to cause transcription 

of RNA in plant cells can be used in the present invention. Promoters 
which are useful in the present invention include any promoter that 
functions in a plant cell to cause the production of a RNA sequence. A 
number of promoters which are active in plant cells and are capable of 

20 producing a RNA sequence have been described in the Uterature. These 
include the nopaline S3mthase (NOS) and octopine synthase (OCS) 
promoters (which are carried on tumor-indudng plasmids of Agrobacterium 
timaefaciens), the caulimovirus promoters such as the cauliflower mosaic 
virus (CaMV) 19S and 35S and the figwort mosaic virus 35S-promoters, 

25 the light-inducible promoter from the small subunit of ribulose-l^S-bis- 
phosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide), 
and the chlorophyll a/b binding protein gene promoter^ etc. All of these 
promoters have been used to create various t3^es of DNA constructs 
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which have been expressed in plants; see, e.g., PCT publication WO 
84/02913 (Rogers et al., Monsanto). 

Promoters may be obtained from a variety of sources such as 
plants and plant viruses. Promoters can be used in the form that they 
5 exist as isolated from plant genes such as ssRUBISCO genes, or can be 
modified to improve their effectiveness, such as with the enhanced 
CaMV35S promoter. 

Those skilled in the art will recognize that the amoimt of linoleic 
acid desaturase needed to induce the desired alteration in Unolenic add 

10 content may vaiy with the type of plant. It is also possible that extremes 
in linoleic acid desaturase activity may be deleterious to the plant. 
Therefore, in a preferred embodiment, promoter function should be 
optimized by selecting a promoter with the desired tissue expression 
capabilities and approximate promoter strength and selecting a 

15 transformant which produces the desired linoleic add desaturase activity in 
the target tissues. 

This selection approach from the pool of transformants is 
routinely employed in expression of heterologous structural genes in plants 
since there is variation between transformants containing the same 

20 heterologous gene due to the site of gene insertion within the plant genome, 
(Commonly referred to as "position effect*'). 

In a preferred embodiment, the promoters utilized in the double- 
stranded DNA molecules should have relatively high expression in tissues 
where the increased or decreased Unolenic acid content is desired, such as 

25 the seeds of the plant. In Canola, a particularly preferred promoter in this 
regard is the seed specific promoter described herein in greater detail in the 

accompanying examples. 

In another preferred embodiment, the promoter used in the 
expression of the double-stranded DNA molecules of the present invention 
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can be a constitutive promoter, expressing the DNA molecnile in all or most 
of the tissues of the plant. However, the promoter selected fdr this 
embodiments should not cause esqiression at levels which are detrimental 
to plant health, growth and development, 
5 B-conglycinin (also known as the 7S protein) is one of the major 

storage proteins in soybean (Glycine max) (Meinke et al., 1981). The 7S (p- 
conglydn) a-subtuiit promoter, used in one aspect of this study to express 
the linoleic add desaturase gene, has been shown to be both highly active 
and seed-specific (Doyle et al, 1986 and Beachy et al., 1985). The fi-subtmit 

10 of B-conglycinin has been expressed, using its endogenous promoter, in the 
seeds of transgenic petunia and tobacco, showing that the promoter 
functions in a seed-specific manner in other plants (Bray et al., 1987). The 
promoter for B-conglydnin could be used to in accordance with the present 
invention. If used, this promoter could express the DNA molecule 

15 specifically in seeds, which coiild lead to an alteration in the linolenic add 
content of the seeds. 

In addition, the endogenous plant linoleic acid desaturase 
promoters can be used in the present invention. These promoters should be 
useful in expressing a linoleic acid desaturase gene in specific tissues, such 

20 as leaves, seeds or fruits. A number of other promoters with seed-specific 
or seed-enhanced expression are known and are likely to be expressed in 
seeds, which are oil accumulating cells. For illustration, the napin promoter 
and the acyl carrier protein promoters have been utilized in the 
modification of seed oil by antisense expression (Knutson et al., 1992). 

25 The linolenic acid content of root tissue can be increased by 

expressing a linoleic acid desaturase gene behind a promoter which is 
expressed in roots. The promoter from the acid chitinase gene (Samac et 
al., 1990) is known to function in root tissue and could be used to express 
the linoleic acid desaturase in root tissue. Expression in root tissue could 
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also be accomplished by utilizing the root specific subdomains of the 
CaMV35S promoter that have been identified. CBenfey et aL, 1989). The 
linolenic acid content of leaf tissue can be increased by expressing the 
linoleic acid desaturase gene using a leaf active promoter such as 
5 ssRUBISCO promoter or chlorophyll a/b binding protein gene promoter. 

The linolenic acid content of fruits can be increased by 
expressing a linolenic acid desattu*ase gene behind a promoter which is 
functional in fruits. Such promoters could be either expressed at all 
developmental stages of the fruit or restricted to specific stages, 

10 particularly fruit ripening. 

The RNA produced by a DNA construct of the present invention 
can also contain a 5' non-translated leader sequence. This sequence can be 
derived from the promoter selected to express the gene, and can be 
specifically modified so as to increase translation of the mRNA. The 5' 

15 non-translated regions can also be obtained from viral RNAs, from suitable 
eukaryotic genes, or from a synthetic gene sequence. The present 
invention is not limited to constructs, as presented in the following 
examples, wherein the non-translated region is derived from the 5' non- 
translated sequence that accompanies the promoter sequence. Rather, the 

20 non-translated leader sequence can be derived from an imrelated promoter 
or coding sequence as discussed above. 
Linoleic Acid Desaturase Structural Coding Sequences 

The structural coding sequence that causes the production of an 
RNA sequence that encodes a linoleic acid desaturase activity can be the 

25 sequences disclosed in the present application, or any sequence that can be 
obtained using the sequences disclosed in the present application, or any 
sequence that can be isolated using the method disclosed in the present 
appUcation. 
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The structiiral coding sequence can also be a part of or &om the 
structural coding sequences disclosed in the present invention. It is possible 
that the active part of the linoleic add desaturase is formed using only part 
of the structural coding sequences disclosed in the present apphcation. 
5 The structural coding sequences can be obtained from a variety 

of sources, such as algae, bacteria or plants. Preferably, structural coding 
sequences obtained from plants are used in accordance with the present 
invention. 

Since virtually nothing was known about the properties of the 

10 linoleic acid desaturase structural coding sequence prior to the present 
invention, the method used in the present invention to isolate the structural 
coding sequence was based on the concept of map based cloning. The 
essential concept in map based cloning is to use information about the 
genetic map position of a structural coding sequence to isolate the region of 

15 the chromosome stirrounding the structural coding sequence, and then to 
use the isolated DNA to complement a mutation in the structural coding 
sequence. This strategy has never previously been reported in the isolation 
of any pl£uit gene. 

In order to implement map based cloning of the linoleic acid 

20 desaturase, mutants of Arabidopsis thaliana (L.) deficient in linoleic acid 
desaturase activity were isolated by screening randomly chosen individuals 
from mutagenized populations of plants for individual plants with altered 
leaf or seed fatty acid composition. (Browse et al. 1985; Lemiexix et al. 
1990). By screening thousands of plants for altered fatty acid composition, 

25 mutants with decreased amoxmts of linolenic acid and increased amounts of 
linoleic acid in leaf and seed Upids were isolated. Physiological and genetic 
analyses of these mutants indicated that they fell into three 
complementation groups designated fadS, fadD and fadE. 
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The fads mutants had very reduced levels of linolenic acid in 
seeds and roots but had almost normal levels of linolenic acid in leaves. 
This effect was interpreted as evidence that the fadS locus encoded a 
microsomal desaturase which was responsible for desaturation of linoleic 
5 acid to linolenic add on lipids made by the pathway of lipid biosjmthesis in 
the endoplasmic reticulum, designated the "eukaryotic pathway*' (Lemieux 
et al. 1990). This pathway is mostly responsible for the synthesis of lipids 
in non-green tissues such as seeds and roots^ but plays a secondary role in 
leaves and other green tissues. Thus, a mutation in the fad3 gene would not 

10 be expected to have a major effect on the desaturation of leaf lipids. 

In contrast to the fadS mutant, the fadD mutant had almost 
normal fatty acid composition of roots and seeds, but had a strong 
reduction in the amoimt of linolenic add in leaf Upids, and a corresponding 
increase in the amount of linoleic acid. (Browse et al., 1986). Thus, this 

15 mutant had the properties expected of a mutant defident in a linoleic add 
desaturase from the prokaryotic pathway which is primarily responsible 
for the synthesis of Upids in green tissues. 

An unusual property of the fadD mutants was that they were 
very deficient in linoleic acid content when grown at temperatures above 

20 about 22 'C but had almost normal fatty add composition when grown at 
temperatures below about 18 *C (McCourt et al., 1987). Since it was very 
imlikely that several independently isolated mutations would all give rise to 
a temperature conditional phenotype, it was concluded that a second 
desaturase must be partially responsible for desaturating linoleic acid to 

25 linolenic acid in green tissues. Therefore, the fadD mutant was 
remutagenized with ethylmethane sulfonate, self-fertilized to produce a 
segregating population of mutagenized plants (designated the M2 
generation), and this population was screened for a mutant which was 
deficient in linolenic acid in green tissues at low temperatures. A mutant 
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with this property was isolated and the mutation responsible for this efifect 
was designated the fadE locus (SomerviUe and Browse, unpublished). 
Isolation of the Linoleic Acid Desaturase Gene from Canola 

The following example was used to isolate the structural coding 
5 sequence from the fadS region. The method described herein could equally 
have been used to isolate either the fadD or fadE region. 

In order to approximately locate the fadS mutation of the genetic 
map of ArabidopsiSf a sexual cross was made between the fadS mutant line 
BLl and the multiply marked mutant Une Wl (Hugly et al., 1991). The Fl 

10 hybrids from this cross were permitted to self-fertilize and the restdting F2 
plants were scored for both the segregating genetic markers and the altered 
fatty add composition. The results of this analysis indicated that the fadS 
mutation was located on chromosome 2 near the marker erecta. In order 
to obtain a more accurate map position by RFLP mappings a second sexual 

15 cross was made between the fadS mutant line BLl and the Niederzenz 
race of Arabidopsis. The Fl progeny were permitted to self-fertilize to 
produce the F2 generation. 137 F2 plants were grown during 3 weeks at 22^ 
C (100 nE/m2/s) in order to produce fully expanded rosettes, and a few 
leaves (representing a total weight of 0,2-0.5 g per plant) were harvested 

20 from each plant in order to prepare DNA from them. 

The leaves were frozen in liquid nitrogen, and ground in dry ice, 
using a mortar and a pestle. For each sample, the frozen powder was 
transferred to a microfuge tube and an equal amoimt of 2 X CTAB buffer 
(2% cetyltrimethyl ammonium bromide (CTAB). 100 mM Tris-HCl pH 8, 

25 20 mM EDTA, 1.4 M NaCl, 1% poljrvinylpolypyrrolidone (FVP) 40,000) was 
added. The tubes were left at room temperature for 5 min to allow the 
powder to thaw. The homogenate was extracted once with a mixture of 
chloroform-isoamyl alcohol (24:1, v/v), and 1/10 vol of 10 X CTAB (10 % 
CTAB, 0.7 M NaCl) buffer was added to the aqueous phase, which was then 
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25 



reextracted with an equal volume of chloroform isoamyl alcohol (24:1. v/v). 
The aqueous phase was transferred to a fresh microfuge tube and 1.5 vol of 
CTAB precipitation buffer (1% CTAB, 50 mM Tris-HCl pH 8. 10 mM 
EDTA) was added. The DNA was allowed to precipitate for 12 hr at 4 
degrees, and coUected by centrifugation (5 min at 10 OOOg). I^e DNA was- 
resuspended in 100 jil of 10 mM Tris-HCl pH 7.5, 1 mM EDTA. 1 M NaCl. 
and 100 Hg/ml RNase A and incubated at 50*0 for 30 min. The DNA was 
precipitated by adding 2.2 vol of ethanol and incubating on ice for 20 mm. 
The DNA was collected by centrifugation and the pellet was washed once 
with 1 ml of 70% ethanol, dried under vacuum for 3 min and resuspended m 
10 Ml of distilled water. The DNA was stored at -20*0 until use. 

The 137 plants were grown to maturity and their seeds were 
collected individually. The fatty acid composition of 10 individual seeds 
from each of the F2 plants was measured as described by Browse et al 
(1986) in order to score the fad3 phenotype of each plant. Each seed was 
incubated in 1 ml of IN HCl in methanol for Ih at SO'C. The tubes were 
cooled to room temperature and 1 ml of 0.9 % NaCl plus 0.3 ml of hexane 
were added. The tubes were agitated by vortexing and the phases separated 
by centrifugation (300xg for 5 min). The hexane phase was saved, 
evaporated under a stream of nitrogen, and the fatty acid methyl esters 
were dissolved in 50 jil hexane. An aUquot (2 jil) was injected onto the gas 
chromatograph and the fatty acid methyl esters separated and quantitated 
by flame ionization as described (Browse et al., 1986). 

The DNA samples (1 \ig) were then cut with the appropriate 
restriction enzyme (EcoRl for the marker # 220. Bgl2 for the marker 
ASA2) using a concentration of IXKGB buffer (Sambrook et al. 1989), 5 
units of the restriction endonuclease and 100 ng/ml BSA. The volume of 
each sample was 10 nl and the incubation was performed at 37 'C for 4 h. 
The fragments were resolved by agarose gel electrophoresis (0.8 % agarose 
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in IX TAE buffer; Sambrock et al., 1989) and transferred to nylon filters 
(hybond N+), using the alkaline transfer method as described by the 
manufacturer. The nylon filters were probed (according to Church and 
Gilbert, 1984) with radioactively labelled fragments of DNA (Sambrock et 
al., 1989) corresponding to known RFLP markers which had previously 
been mapped in the approximate vicinity of the fadS locus on chromosome 
2. The RFLP markers 220 (Chang et al 1988) and ASA2 were found to 
map close to the fadS locus. Analysis of the pattern of recombinants 
(Table 1) indicated that both ASA2 and 220 were located on the same side 
of the fads locus at distances of 0.4 and 2.2 centimorgans (cM), 
respectively. 

Table 1 



20 



# of plants 


220 


A$A2 




67 


H 


H 


+/. 


30 


L 


L 


-/- 


34 


N 


N 


+/+ 


3 


H 


N 


+/+ 


1 


L 


H 


+/- 


1 


N 


H 


+/- 


1 


H 


H 


-I- 



Table 1 shows the genotsrpe of the F2 plants used for mapping 
the fad 3 locus. L is for Landsberg (backgroimd of the fad 3 mutant), N is 
for Niederzenz, H for heterozygous. A total of 137 F2 plants were analyzed. 
25 The number of recombinant plants between fadS and 220 or ASA2 was 6 
and 1 respectively. 

In order to isolate the region of the chromosome containing the 
fads locus, the RFLP markers 220 and ASA2 were used as hybridization 
probes to screen several yeast artificial chromosome (YAC) libraries. (Grill 
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and Somerville, 1991; Ward and Jen, 1990). The YAC filters were prepared 
according to Grill and Somerville (1991). The library was replicated onto 
nylon filters disposed on petri dishes of SC — (s3mthetic complete medium 
minus tryptophan and uracil; Sherman et al., 1986). The cells were allowed 
5 to grow for 12 h at 30"C, and the filters were transferred for 15 min on a 
Whatman 3MM paper saturated with 1 M sorbitol, 50 mM DTT, 50 nxM 
EDTA (pH 8). 

The cell wall of the cells was then digested with lyticase, by 
incubating the filters on a Whatman paper saturated with IM sorbitol, 50 

10 mM EDTA and 2 mg/ml lyticase (Sigma Co., St. Louis,MO) for 12 h at 
30''C. The filters were then transferred on a Whatman 3MM paper 
saturated with 0.5 M NaOH, 1.5 M NaCl for 15 min, neutralized with 0.5 M 
Tris-HCl pH 8 for 15 min and quickly rinsed in 2XSSC (SSC is lOmM 
sodium citrate, 150mM NaCl, pH 7). The filters were allowed to dry, and 

15 were transferred to a vacuum oven at 80*C for 1 h. They were 
subsequently hybridized according to Church and Gilbert (1984), with 
probes labelled with 32p according to Sambrook et al. (1989). 

The DNA of RFLP probe 220 was prepared firom 100 ml of liquid 
culture lysate using the lambdasorb procedure (Promega Corp., Madison, 

20 WI); the cDNA encoding ASA2 was excised firom the original plasmid 
(pKN140C; obtained from Dr. G. Fink, Whitehead Institute, Cambridge, 
MA) with Hind3 and cloned into the HindS site of pBLUESCRIPT. The 
plasmid DNA was then purified by Cesium chloride gradients according to 
Sambrook et al (1989), digested with HindS and the DNA insert was gel 

25 purified twice by electroelution according to Sambrook et al (1989). 

In order to probe the libraries, the whole DNA from RFLP220 
was used as a hybridization probe. By contrast, only the DNA insert of 
ASA2 was used as a probe. The RFLP probe 220 hybridized to YAC 
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EG4E8 and EG9D12. The probe ASA? hybridized to YACs EW15G1, 
EW15B4 and EW7D11. 

In order to determine if these YACs contained all of the DNA 
between RFLP220 and ASA2, small regions of DNA from the ends of the 
5 inserts in EG4E8 and EW15G1 were prepared by inverse PGR (Grill and 
Somerville, 1991). For that purpose, DNA was prepared from the 
appropriate YAC clones. The clones (single colonies) were grown to 
saturation in SC — liquid cultures, and 1 ml of these cultures was used to 
inoculate 40 ml liquid cultures (in SC — medium) that were allowed to grow 

10 for 16 h at 30°C. The cells were collected by centrifugation, washed once in 
1 M sorbitol, 50 mM EDTA, resuspended in 200 ^1 of 1 M sorbitol, 50 mM 
EDTA, 100 mM sodium citrate pH 5.8, 2 mM P-mercaptoethanol and 2 
mg/ml l3rticase, and incubated 2 h at 30 *C. 

Next, 350 ^1 of 2XCTAB buffer was added and the DNA was 

15 purified as described above. DNA (5 \ig) of each clone was digested 
separately with Hindi, Alul, EcoRV and Rsal (in IXKGB buffer, at 37 'C 
for 4 h; final volume: 50 ^1). The reactions were stopped by heating at 65 
''C for 15 min, extracted once with one voltmEie of phenol saturated with TE 
pH 8, followed by an extraction with 1 volume of chloroform - isoamyl 

20 alcohol mixture (24:1, vol/vol). The DNA was recovered by ethanol 
precipitation and resuspended in sterile distilled water. The ligation 
reactions were perfonned using 300 ng of DNA in a final volume of 50 |il. 
The reactions were carried out in 50 mM Tris-HCl pH 7.4, 10 mM MgC12, 1 
mM DTT,1.2 mM ATP with 1 U of ligase, for 2 h at 20 'C, and stopped by 

25 heating at 68 'C for 30 min. 

The PCR reactions were carried out as follows: The buffers used 
were the ones indicated by the suppliers except for the Perkin Elmer 
enzyme for which the reaction was supplemented with an additional 1.4 
mM MgCl2 (final concentration 2.9 mM Mg). The dNTP final concentration 
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was 125 jiM when the Perkin Ehner enzyme was used and 200 with the 
Taq polymerases from other soiirces. In all cases, 100 ng of each 
oligonucleotide was used* The final volume was 100 ^il. When no product 
was obtained, the reactions were carried out again in the same conditions 
5 except that formamide was added to a final concentration of 3 %. 

The left end was amplified fi'om the ligation products of the 
EcoEV and Rsal digests, using the oligonucleotides EGl 
(GGCGATGCTGTCGGAATGGACGATA) (SEQ. ID NO. 3) and EG2 
(CTTGGAGCCACTATCGACTACGCGATC) (SEQ, ID NO. 4). 

10 The right end of the clones obtained fi-om the EG library was 

amplified from the ligation products of the Alul and Hindi digests, using 
the oligonucleotides EG3 (CCGATCTCAAGATTACGGAAT) (SEQ. ID NO. 
5) and EG4 (TTCCTAATGCAGGAGTCGCATAAG) (SEQ. ID NO. 6). 

The right end of the clones obtained from the EW YAC library 

15 was ampUfied using the ohgonucleotides HI (AGGAGTCGCATAAGGGAG) 
(SEQ. ID NO. 7) and H2 (GGGAAGTGAATCJGAGAC) (SEQ. ID NO. 8), 
using the same cycle conditions as above, except that the annealing 
temperature was reduced to 50 *C. 

After the reactions were completed, 5fil of each mixture were 

20 electrophoresed on an agarose gel to separate the amplification product 
from primers. The slice of agarose that contained the ampHfied band was 
excised firom the gel and melted in 1 ml of distilled water. Large amounts of 
product could then be produced, by reamplification of 5 of the melted 
slice. The PGR products were then purified by electroelution or by using 

25 GeneClean (BiolOl) and used as hybridization probes to probe filters 
containing the isolated YAC DNA restricted by several enz3rmes. The 
probe made from the right end of EW15G1 hybridized to EG4E8 and 
similarly, a probe from the right end of EG4E8 hybridized to EW15G1. 
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Thus, it was concluded that the YACs EG4E8 and EW15G1 contained all of 
the DNA in the region of the chromosome between RFLP220 and ASA2. 

The size of the YAC clones was estimated by field inversion 
electrophoresis (CHEF, Vollrath and Davis, 1987). High molecular weight 
5 DNA was prepared as follows: the yeast cells which contained the YAC 
clones were grown and treated with l}rticase as for preparing DNA as 
described above. The spheroplasts were then resuspended in an equal 
volume of IM sorbitol, 50 mM EDTA, 1 % low melt agarose at ST'C. The 
mixture was poured in a mould (Biorad) which was set on ice to allow the 
10 agarose to harden. 

The resulting plugs were incubated for 12 h in 0.5 M EDTA pH 9, 
1% lauryl sarcosine 1 mg/ml Proteinase K at 50"C. The plugs were 
subsequently washed twice in 50 mM EDTA and stored at 4'C until use. 
The CHEF gel was run in IXTBE for 16 h at 200 V, with a switching 
15 interval of 20 s; the temperature of the buffer was maintained at 14 'C 
during the run. The sizes of the YACs were determined by comparison with 
a lambda ladder and the yeast chromosomes, and were as follows: EGr4E8, 
90 kb; EG9D12, 190 kb; EW15G1, 90 kb; EW15B4, 70 kb, EW7D11, 125 
kb. These sizes permitted us to roughly determine a correspondence 
20 between physical and genetic distances: the distance that separates 220 
from ASA2 cannot exceed 180 kb, the sum of the size of the 2 YACs 
EGr4E8 and EW15G1. Since the corresponding genetic distance is 1.7 cM, 
one can rov^hly estimate that, in this particular cross and in this particular 
region of the genome, the value of 1 cM is close to lOOkb. Thus, since the 
25 fads gene maps only 0.4 cM away from ASA2, the corresponding physical 
distance should be close to 40 kb. We then concluded that fadS was 
probably located on the YAC EW7D11, which is the largest YAC 
hybridizing with ASA2. See Figure 1. 
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In order to test the possibility that the YAC EW7D11 carried 
the fads gene, the YAC was used to probe a cDNA library made from 
developing seeds of Canola (Brassica napus L.). Even though the YAC was 
isolated from Arabidopsis, the fact that Arabidopsis and B. napus are both 
5 members of the family Cruciferae led us to predict that the homologous 
genes from these two species would be s\iificiently identical at the 
nucleotide sequence level so that the Arabidopsis gene would hybridize to 
the £. napus gene. We also assumed that, because it catalyzes a 
chemically similar reaction to the stearoyl-ACP desaturase, it woidd be 

10 expressed at similar moderately high levels in developing seeds (Shanklin 
and Somerville, 1991). Since EW7D11 contained only about 0.2% of the 
total genome, we expected it to contain only about 2 moderately 
abtmdantly expressed genes (i-e., genes in which the mKNA is between 0.1 
and 0,01% of total mRNA). 

15 DNA of YAC EW7D11 was isolated as follows: high molecular 

weight DNA was prepared from the yeast cells that contained the YAC 
EW7D11 as described above, and several preparative low-melt agarose 
CHEF gels were run in IXTBC buffer (same as TBE except that CDTA 
was substituted for EDTA). The sUces that contained the YAC were excised 

20 from the gels and pooled. Three slices were melted at BS^C and extracted 
with an equal volume of phenol saturated with TE. The aqueous phase was 
saved and reduced to 0.5 ml by repeated extractions with isobutyl alcohol. 
The remaining agarose was removed by several phenol extractions, followed 
by two chloroform-isoamyl alcohol extractions. The DNA was precipitated 

25 by adding 2 |ig of linear acrylamide as a carrier plus 10 |xl of 5M NaCl and 
1.1 ml of ethanol, and incubating 20 min at 0 *C. The DNA pellet was 
recovered by centrifugation, washed in 70% ethanol, dried under vacuum 
and resuspended in 50 \xl of distilled water. The DNA (50 ng) was 
radioactively labelled and used to probe a cDNA library in Xgtll. 
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The nitrocelliilose filters were processed as described in 
Sambrook et al (1989). Duplicate filters were used, and the films were 
exposed 5-7 days in order to obtain a good signal. From among 200,000 
plaques screened in this way, 31 hybridized to EW7D11. Among these 31 
5 clones, 17 were homologous to each other, as checked by cross 
hybridization in stringent conditions. The size of the inserts in the 17 clones 
was estimated and the clone with the largest cDNA was retained for 
further analysis. A small scale preparation of this phage was prepared 
using the lambdasorb method, and the insert was excised by restricting 

10 with EcoEl. This insert was ligated into a pBLUESCRIPT II vector 
linearized with EcoRI, and the Ugation mixture was used to transform E. 
coU strain DH5a. 

One of the recombinant clones was designated pBNDESS 
(Figure 2), and retained for sequencing. The sequence was determined on 

15 both strands, using the sequenase enzyme, (US Biochemicals, Cleveland, 
OH) according to the instructions provided by the suppUer. The nucleotide 
sequence of the insert in pBNDESS is presented as Figure 3. The deduced 
amino acid sequence of the largest open reading frame in the nucleotide 
sequence is also shown in Figure 3. 

20 Comparison of the deduced amino acid sequence of the 383 

amino acid open reading frame in clone pBNDES3 against the known 
sequences in GenBank release 70 was performed using the FASTA 
program (Lipman and Pearson, 1985). This analysis revealed that the 
sequence from pBNDESS had a region of significant homology to a 

25 previously characterized desaturase gene from the cyanobacterium 
Ssoiechocystis (Figure 4). (Wada et al. 1990). This was considered 
suggestive evidence that the clone pBNDESS encoded a desaturase which 
was probably the fad3 structural coding sequence product. This was 
subsequently confirmed by a genetic complementation experiment. 
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The cDNA was cloned into plant transformation vector pBI121 
(Figure 5) under the control of the CaMV35S promoter to construct 
pTiDESS (Figure 6). Plasmid pTiDES3 was introduced into an 
Agrobacterium tumefaciens strain which also carried an Ri plasmid and this 
5 was used to produce transgenic rooty tumors from both wild type 
Arabidopsis and the fad3 mutant. Transgenic tissue was selected for 
antibiotic resistance to confirm the presence of the pTiDESS. Fatty acid 
methyl esters were then prepared and examined by gas chromatography to 
determine the profile of fatty adds being produced in the tissue. The levels 
10 of linolenic acid increased, demonstrating that the cDNA on pTiDESS can 
complement the fadS mutation. These results, which are described in detail 
in Example 1 below, confirm the identity of the cDNA as encoding a linoleic 
acid desaturase. 

The isolation of a plant structural coding sequence provides 
15 those skilled in the art with a tool for the manipulation of gene expression 
by the mechanism of antisense RNA The technique of antisense RNA is 
based upon introduction of a chimeric gene which will produce an RNA 
transcript that is complementary to a target gene (reviewed in Bird and 
Ray, 1991). The resulting phenotype is a reduction in the gene product 
20 firom the endogenous gene. The portion of the gene which is sufficient for 
achieving the antisense effect is variable in that numerovis fragments or 
combinations thereof are likely to be effective. Various portions of the 
structural coding sequence of linoleic add desaturase isolated either from 
cDNA or genomic clones are likely capable of reducing linolenic add levels in 
25 plants by reduction in levels of Hnoleic acid desaturase levels. An example 
of using an antisense oriented linoleic add desaturase structural coding 
sequence is set out in Example 2. 
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Polvadenvlation Signal 

The 3' non-translated region of the double stranded DNA 
molecule of the present invention contains a region that functions in plant 
cells to promote polyadenylation to the 3' end of the RNA sequence. Any 
5 such regions can be used within the scope of the present invention. 
Examples of suitable 3* regions are (1) the 3* transcribed, non-translated 
regions containing the polyadenylated signal of Agrobacterium tumor- 
inducing (Ti) plasmid genes^ such as the nopaline S3mthase (NOS) gene, and 
(2) 3' regions of plant genes like the soybean storage protein genes and the 

10 small subunit of the ribulose-l,5-bisphosphate carboxylase (ssRUBISCO) 
gene. An example of a preferred 3' region is that from the NOS gene, 
described in greater detail in the examples below. 
Plant Transformation/Regeneration 

Any plant which can be transformed to contain the double- 

15 stranded DNA molecule of the present invention are included within the 
scope of this invention. Preferred plants which can be made to have 
increased or decreased linolenic acid content by practice of the present 
invention include, but are not limited to sunflower, safflower, cotton, com, 
wheat, rice, peanut, canola/oilseed rape, barley, sorghum, soybean, flax, 

20 tomato, almond, cashew and walnut. 

A double*stranded DNA molecule of the present invention 
containing the functional plant Unoleic add desaturase gene can be inserted 
into the genome of a plant by any suitable method. Suitable plant 
transformation vectors include those derived from a Ti plasmid of 

25 Agrobacterium tumefacienSy as well as those disclosed, e.g., by Herrera- 
Estrella (1983), Bevan (1984), Klee (1985) and EPO publication 120,516 
(Schilperoort et al.). In addition to plant transformation vectors derived 
from the Ti or root-inducing (Ri) plasmids of Agrobacterium j alternative 
methods can be used to insert the DNA constructs of this invention into 
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plant cells. Such methods can involve, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free DNA 
delivery via microprojectile bombardment, and transformation using 
bacteria, viruses or pollen. 
5 A plasmid expression vector, suitable for the expression of the 

linoleic acid desaturase gene in monocots is composed of the following: a 
promoter that is specific or enhanced for expression in the lipid storage 
tissues and a 3* polyadenylation sequence such as the nopaline sjmthase 3* 
sequence (NOS 3'; Fraley et al., 1983). This expression cassette may be 

10 assembled on high copy replicons suitable for the production of large 
quantities of DNA. 

A particularly useful Agrobacterium-haaed plant transformation 
vector for use in transformation of dicotyledonous plants is plasmid vector 
pMON530 (Rogers, S.G., 1987). Plasmid pMON530 {see Figure 7) is a 

15 derivative of pMON505 prepared by transferring the 2.3 kb StuI^Hindlll 
fragment of pMON316 (Rogers, S.G., 1987) into pMON526. Plasmid 
pMON526 is a simple derivative of pMON505 in which the Smal site is 
removed by digestion with Xmal, treatment with Klenow polymerase and 
hgation. Plasmid pMON530 retains all the properties of pMON505 and the 

20 CaMV35S-NOS expression cassette and now contains a unique cleavage 
site for Smal between the promoter and polyadenylation signal. 

Vector pMON505 is a derivative of pMON200 (Rogers, S.G,, 
1987) in which the Ti plasmid homology region, LIH, has been replaced with 
a 3.8 kb Hindlll to Smal segment of the mini RK2 plasmid, pTJS75 

25 (Schmidhauser & HeUnski, 1985). This segment contains the RK2 origin of 
replication, oriV, and the origin of transfer, oriT, for conjugation into 
Agrobacterium using the tri-parental mating procedure (Horsch & Klee, 
1986). Plasmid pMON505 retains all the important features of pMON200 
including the synthetic multi-linker for insertion of desired DNA fi-agments, 
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the chimeric NOS/NPTII'/NOS gene for kanamydn resistance in plant 
cells^ the spectinomycin/streptomycin resistance determinant for selection 
in coli and A. tumefaciens^ an intact nopaline synthase gene for facile 
scoring of transformants and inheritance in progeny and a pBR322 origin of 
5 replication for ease in making large amounts of the vector in E. coli. 
Plasmid pMON505 contains a single T-DNA border derived from the right 
end of the pTiT37 nopaline-type T-DNA Southern analyses have shown 
that plasmid pMON505 and any DNA that it carries are integrated into the 
plant genome, that is, the entire plasmid is the T-DNA that is inserted into 

10 the plant genome. One end of the integrated DNA is located between the 
right border sequence and the nopcdine S3mthase gene and the other end is 
between the border sequence and the pBR322 sequences. 

When adeqiiate numbers of cells (or protoplasts) containing the 
linoleic acid desatiirase gene are obtained, the cells (or protoplasts) are 

15 regenerated into whole plants. Choice of methodology for the regeneration 
step is not critical, with suitable protocols being available for hosts from 
Leguminosae (alfalfa, soybean, clover, etcj, Umbelliferae (carrot, celery, 
parsnip), Cruciferae (cabbage, radish, rapeseed, etc.), Cucurbitaceae 
(melons and cucumber), Gramineae (wheat, rice, com, etc.), Solanaceae 

20 (potato, tobacco, tomato, peppers) and various floral crops. See, e.g., 
Ammirato (1984); Shimamoto, 1989; Fromm, 1990; Vasil and Vasil, 1990, 
Uses of Linoleic Acid Desaturase 

The present invention can be used for any modification (either 
increase, decrease, or mere change) of the oil content of a plant or plant 

25 tissue. Linolenic acid is an important constituent of several membranes in 
plant cells. 

One preferred method is to modify the oil content of the plant to 
improve the plant's temperature sensitivity. For instance, plants deficient 
in Unolenic acid display reduced fitness at low temperature (Hugly and 
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Somerville, 1992). Also^ increased linoleic add content in vegetative tissues 
has been implicated as a factor in freezing tolerance in higher plants 
(Steponkus et al., 1990 and references therein). In a preferred 
embodiment, expression of the Unoleic acid desaturase structural coding 
5 sequence can result in the genetic modification of higher plants to achieve 
tolerance to low environmental temperatures. Transformation with 
pTiDESS demonstrates that linolenic acid levels can be increased by 
expression of this gene in a constitutive manner. Chilling or freezing injury 
in crops may be overcome by expression of this gene in vegetative or 

10 reproductive tissues by employing an appropriate promoter. 

Ltinolenic acid, a polyimsaturated fatty acid, is also extensively 
used in the paint and varnish industry in view of its rapid oxidation. Flax 
seed is a predominant source of this oil. Higher quantities of this fatty acid 
in rapeseed or soybean will provide opportunities for using vegetable oils 

15 from these sources as a replacement for linseed (flax) oil. Expression of a 
Unoleic acid desaturase structural coding sequence in seed tissue can result 
in a higher proportion of linolenic add in the storage oil. 

Linolenic acid is further a precursor in the biosjnithesis of 
jasmonic acid, an important plant growth regulator. Linolenic acid is 

20 converted to jasmonic acid by introduction of an oxygen to the carbon chain 
by a lipoxygenase, followed by dehydration, reduction, and several P- 
oxidations (Vick and Zimmerman, 1984). The activity of jasmonic add has 
been measured in terms of induction of pathogen defense responses. By 
application of free linolenic acid to plants, plant pathogen defenses can also 

25 be induced (Farmer and Ryan, 1992). A model has been proposed to explain 
the ability of free linolenic acid to exhibit the effects associated with 
jasmonic acid (Farmer and Ryan, 1992). It is hypothesized that all of the 
enzymatic activities which are required for the conversion of linolenic add 
to jasmonic acid are constitutively present in the cell and the rate Umiting 
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step in the production of jasmonic add is the availability of free linolenic 
add. A likely route for the production of the free linolenic acid is by the 
activity of a lipase in the plasma membrane. 

It further has been observed that exogenous jasmonic add can 
5 more powerfully activate defense responses than can wounding. This 
suggests that wounds cannot generate enough free linolenic acid to support 
high level production of jasmonic add. The activity of the lipase or the 
availability of appropriate substrate for the lipase may be rate limiting 
upon wounding. By increasing levels of available substrate, increasing 

10 linolenic acid levels in the plasma membrane, it should be possible to 
enhance a plant's ability to respond to pathogens by allowing for a higher 
production of jasmonic acid. Expression of a linoleic add desaturase 
structural coding sequence can result in a higher molar percent linolenic 
acid in the plasma membrane of a plant cell therefore enhancing the 

15 jasmonic acid signaling pathway. It is our intent to evaluate plants 
containing high linolenic acid levels in root and foliar tissues for their 
pathogen resistance. 

It is also undesirable to have significant levels of linolenic acid in 
cooking oils. Linolenic add is unstable during cooking and is rapidly oxidized. 

20 The oxidized products impart randdity to the finished product. Rapeseed or 
soybean oil containing less than about 3%, and preferably 2% or less of 
linolenic add is ideal for use as a cooking oil. By expression of the antisense 
of the structtiral coding sequence for linoleic add desaturase, it is possible 
to reduce the linolenic acid content of these oils. 

25 All higher plants have linolenic add and, therefore, contain genes 

for linoleic acid desaturases. Because of the many examples in which genes 
isolated from one plant species have been used to isolate the homologous 
genes from other plant species, it is apparent to any one skilled in the art, 
that the results presented here do not only pertain to the use of the B. 
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napus fads gene» or to the use of the gene to modify fatty acid composition 
in B. napus . Obviously, the linoleic add desaturases from many organisms 
could be used to increase linolenic acid biosjnathesis and accumulation in 
plants and enzymes from any other higher plant or algae can serve as 
5 sources for linoleic acid desaturase genes. For example, since a YAC 
containing the Arabidopsis gene was used to isolate the B. napus gene, it is 
apparent that the insert in pBNDESS could be used as a probe of genomic 
libraries for isolation of the corresponding full length genes from other plant 
species. It is also likely that the information contained in the sequence of 

1 0 this gene \vill be useful to done other hpid desaturases genes. 

Expression of a linoleic add desaturase in a sense orientation 
may also allow for the isolation of plants with reduced levels of linolenic 
add. This cotdd be accomplished by the mechanism of co-suppression (Bird 
and Ray, 1991). The molecular mechanism of co-suppression is at this 

15 time poorly understood but occurs when plants are transformed with a gene 
that is identical or highly homologous to an allele found in the plants 
genome. There are several examples where expression of a chimeric gene in 
plants can result in a reduction of the gene product from both the chimeric 
gene and the endogenous gene(s). Those skilled in the art will recognize that 

20 the resulting decrease in linolenic add would be a direct result of expression 
of the linoleic acid desaturase structural coding sequence and would be 
correlated to the linoleic add desaturase activity in the transformed plant. 

Linolenic acid levels in plant cells can also be modified by 
isolating genes encoding transcription factors which interact with the 

25 upstream regulatory elements of the plant linoleic add desaturase gene(s). 
Enhanced expression of these transcription factors in plant cells can effect 
the expression of the Unoleic add desaturase gene. Under these conditions, 
the increased or decreased Unolenic acid content would also be caused by a 
corresponding increase or decrease in the activity of the linoleic acid 
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desaturase enzyme although the mechanism is different. Methods for the 
isolation of transcription factors have heen described (Katagiri, 1989); 

The following examples are provided to better elucidate the 
practice of the present invention and should not be interpreted in any way 
5 to limit the scope of the present invention. Those skilled in the art will 
recognize that various modifications, truncations, etc. can be made to the 
methods and genes described herein while not departing from the spirit and 
scope of the present invention. 

10 Expression of fad3 gene to increase hnolenic add 

To verify the assumption that the cDNA insert in pBNDESS 
encodes a linoleic acid desaturase, both wild type and fadS mutation 
Arabidopsis were transformed to contain the cDNA insert. In order to 
express the linoleic add desaturase structural coding sequence (hereafter 

15 referred to as the ""fadS gene") in plant cells, the plasmid pBNDESS was 
digested with Xhol and the ends were filled in with the Klenow fi*agment of 
DNA polymerase (Sambrook et al 1989). The cDNA insert was 
subsequently excised by digestion with Sacl and ligated into the Sacl and 
Smal sites of the binary Ti plasmid vector pBI121 (Clontech 

20 Laboratories), thereby replacing the GUS reading firame. The ligation 
reaction was carried out in 20 ^1 for 12 h at 16 *C using 100 ng of both 
insert and vector, and one unit of T4 DNA ligase. The ligation mixture was 
used to transform competent DH5a E. coli cells (prepared by the calcium 
chloride method, according to Sambrook et al, 1989), and transformants 

25 were selected on L-broth plates that contained 50 |ig/(xl Kanamycin. 
Alkaline minipreparations of recombinant clones were analyzed for the 
correct restriction pattern. One of these plasmids, designated pTiDESS, 
was used for further experiments. 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 




PCT/US94/01321 



-33- 

This plasmid was electroporated (according to Mersereau and 
Pazour, 1990) into Agrobacterium tumefaciens strain RIOOO which carries 
an Hi plasmid. The transformed bacteria were selected on kanamycin LB 
plates for 2 days at 30 *C. DNA minipreparations of several recombinant 
5 bacteria were performed and analyzed as described above to verify the 
presence of the construct. 

Yoting flowering stems of wild type and the fadS mutant of 
Arabidopsis were sterilized for 30 min in 10% commercial bleach, 0.02% 
Triton XlOO, and 2-cm ezplants that contained the flowering stem were 

10 infected with RIOOO (pTiDES3) This was performed by dipping the 
sectioned extremity in a drop of an overnight culture of the appropriate 
Agrobacterium that was grown from a single colony in LB medium 
supplemented with 50 ug/ml Kanamydn. 

The infected stems were cultured for two days on solid MSO 

15 mediimi (Gibco MS salts plus Gamborg B5 vitamins, 3% sucrose and 0.8% 
agar). At this time the stem segments were transferred for 5 weeks to 
MSO medium containing 200 ^g/ml cefotaxime to kill the bacterium. After 
approximately two weeks, most of the stem explants had developed rooty 
ttunors resulting from transfer of parts of the Ri plasmid into cells of the 

20 stem explants. In order to identify the rooty tumors which had also 
received the binary Ti plasmid pTiDESS, approximately 24 rooty timiors 
from each treatment were transferred to MSO medium containing 50 ^g/ml 
of kanamycin to select for the growth of those roots which had been 
cotransformed with the binazy Ti plasmid; the medium contained also 200 

25 ^g/ml of cefotaxime to inhibit bacterial growth. Following a further period of 
growth for 2 weeks, fatty acid methyl esters were prepared (as described 
above) from the roots for analysis by gas chromatography. The restdts of 
these analyses are presented in Table 2. 
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Table2. Genotype 

mol% A^dtype fiad3 wildtype fadS 

Fatty acid pBI121 pBI121 pTiDESS pTiDES3 



16:0 


22.0±2.9 


21.2±1.6 


21.1±0.9 


21.3±2.3 


16:1 


2.5±0.7 


1.6±0.8 


2.0±0.1 


1.5±0.2 


18:0 


2.3±1.9 


2.3±1.9 


1.9±0.2 


1.6±0.4 


18:1 


3.8±1.3 


5.9±2.6 


7.7±2.0 


9.1±2.0 


18:2 


37.3±3,7 


62.2±5.9 


15.7±11.7 


24.4±14.9 


18:3 


31.9±4.5 


6.7±0.7 


51.3±10.9 


42.1±15.5 



10 



Table 2 shows the fatty acid composition of transgenic roots. 
The transgenic roots resulting from infection of wild type or the fad3 
mutant with A* tumefaciens RIOOO carrying the vector (pBI121) or the 

15 plasmid pTiDESS were grown in the presence of kanamycin (50 g/ml) for 
three weeks to identify the roots which had been cotransformed with one of 
these plasmids. The fatty add composition of the roots was determined as 
previously described (Browse et al., 1986). The abbreviations used in Table 
2 are as follows: 16:0, palmitic add; 16:1, palmitoleic add; 18:0, stearic add; 

20 18:1, oleic add; 18:2, hnoleic add; 18:3, linolenic add. The values presented 
are the mean ± SD (n=12). 

From these results it can be seen that the production of rooty 
tumors containing pBI121 on wild type Arabidopsis or the fadS mutant had 
no effect on the fatty acid composition over non-pBI121 containing wild 

■ 

25 type Arabidopsis or fadS mutant. By contrast, transformation of the fadS 
mutant with the plasmid pTiDESS resulted in large increases in the 
content of linolenic acid. In contrast to the linolenic acid content of 6.7 +/- 
0.7% in the fadS mutant transformed with pBI121, the presence of 
pTiDESS resvdted in accumulation of 42.1% of the fatty acids as linolenic 

30 acid. The increased content of linolenic acid was accompanied by a 
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decrease of corresponding magnitude in the content of linoleic acid. Thus, it 
is clear that the fadS gene encodes a linoleic acid desaturase* Introduction 
of the fads gene into wild type tissues also resulted in significantly 
increased accumulation of linolenic acid and a corresponding decrease in 
5 linoleic acid (Table 2). Thus, it is apparent from these results that the 
linoleic acid content of plant tissues can be increased by high level 
expression of a linoleic acid desaturase. In the present embodiment, the 
iadS gene was placed imder transcriptional control of the constitutive high 
level CaMV 35S promoter carried on pBI121. The implication from these 

10 results is that expression from this promoter raised the level of expression 
of the fads gene to levels higher than are normally achieved by expression 
from the endogenous fadS promoter. The results presented here indicate 
that the fadS gene has significant utility in genetic modification of higher 
plants to elevate linolenic acid levels. 

15 Example 2 

Antisense expression of fadS gene to decrease linolenic add levels 

In order to decrease the linoleic acid desaturase activity by 
genetic engineering methodology, the cDNA insert of pBNDESS was cloned 
into plant expression cassettes in an antisense orientation. A 959bp Bglll 

20 restriction fragment of pBNDESS was used in the antisense expression 
vectors. The fragment is from 152 nucleotides downstream of the initiating 
methionine codon of the cDNA to a second BgUI restriction site that is 
located near the C-terminus of the coding region, 189 nucleotides of the 
coding region are excluded from this fragment. Triple ligations were 

25 performed with the fadS gene friagment to construct two separate plant 
expression cassettes. 

A seed specific expression cassette was constructed by insertion 
of the Bgin fragment of pBNDESS in an antisense orientation behind the 
soybean promoter for the a' subunit of p-conglycinin (7S promoter). A 
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975bp Hindlll to BglU fragment containing the 7S promoter derived from 
pMON529 was prepared by digesting with BgUI for SOmin at 37 *C followed 
by addition of Calf Intestinal Alkaline Phosphatase (CIAP) (Boehringer 
Mannheim). The reaction was allowed to proceed for 20min followed by 
5 purification of the linearized DNA using the GeneClean (Bio 101) 
purification system. The DNA was then digested with Hindm. A firagment 
derived from pMON999 containing the Nopaline sjnithase 3' region and the 
pUC vector backbone was prepared by digestion with BamHI and 
treatment with CIAP. The DNA was purified by the GeneClean procedure 

10 and digested with Hindlll. The fi^agment of pBNDES3 was prepared by 
digestion with Bgin. The three fragments were purified by agarose gel 
electrophoresis and the GreneClean procedtire. 50 to 200ng of the purified 
fragments were ligated for one hour at room temperature followed by 
transformation into the E, coli strain JMIOI. Restilting transformant 

15 colonies were used for plasmid preparation and restriction digestion 
analysis. Double digestion with Bglll and Ncol was used to screen for 
transformants containing the fadS gene in an antisense orientation. One 
clone was designated as correct and named pMON13801. 

A second expression cassette was constructed to allow for 

20 constitutive expression of the antisense message in plants. A fragment 
containing the enhanced 35S promoter was prepared from pMON999 by 
restriction digestion with Hindlll and Bglll followed by treatment with 
CIAP as above. The correct sized fragment was obtained by agarose gel 
electrophoresis and the GeneClean procedure. The Bglll to Hindlll vector 

25 firagment and the Bglll fragment of pBNDESS which were purified above 
were used in this construction. Ligation, transformation and screening of 
clones were as described above. One clone was designated as correct and 
named pMON13802. 
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In both pMON13801 and pMON13802, the promoter, fad3 gene 
and the Nos 3' region can be isolated on a NotI restriction fragment. These 
fragments can then be inserted into a unique NotI site of the vector 
pMON17227 to construct glyphosate selectable plant transformation 
5 vectors. The vector DNA is prepared by digestion with NotI followed by 
treatment with CIAP. The fadS containing fragments are prepared by 
digestion with NotI, agarose gel electrophoresis and purification with 
GeneClean. Ligations are performed with approximately lOOng of vector 
and 200ng of insert DNA for 1.5 hours at room temperature. Following 

10 transformation into the E. coli strain LE392, transformants were screen 
by restriction digestion to identify clones containing the fadS expression 
cassettes. Clones in which transcription from the fadS cassette is in the 
same direction as transcription from the selectable marker were designated 
as correct and named pMON13804 (FMV/CP4/E9, 7S/anti fad3/NOS) 

15 (Figure 8) and pMON13805 (FMV/CP4/E9, E35S/anti fad3/NOS) (Figure 
9). 

In preparation for transforming canola cells, pMON 13804 and 
pMON13805 were mated into Agrobacteriimi ABI by a triparental mating 
with the helper plasmid pRK2013. 

20 Seeds from the plants produced by transformation were 

analyzed for alterations in fatty acid profile. Fatty acid methyl esters 
(FAMES) were prepared from seed tissue and analyzed by capillary gas 
chromatography (Browse et al, 1986). For initial screening of plants, six 
seeds were pooled together from an individual plant. The seeds were 

25 crushed and FAMES extracts were made. Control plants, plants 
transformed with the selectable marker only (pMONl7227), were also 
analyzed using the identical procedure. From the initial screen on pooled 
seed samples, several lines were identified which displayed a decreased level 
of linolenic acid. Lines with decreased levels of linolenic acid were 
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reanal3rzed by determining fatty add profiles from individual seeds. Four to 
twenty individual seed were analyzed from candidate lines and from 
selected control plants. The restilts of the FAMES analysis is summarized 
in Figure 9. 

5 Figure 9 shows the levels of fatty acids expressed in molar 

percent of twenty individual seed of the transgenic line 13804-51 as 
compared to control seed. Panel A discloses oleic acid, panel B discloses 
linoleic add and panel C disdoses linolenic add. 

The data in Figure 9 demonstrate that antisense expression of a 

10 linoleic add desaturase has significantly altered the fatty add profile of the 
residting seed tissue- The percent of linolenic add has been reduced to a 
little over 2% of the total fatty acid in the seed tissue. The percent of 
linoleic acid has been reduced slightly and surprisingly, the percent of oleic 
acid in the seed has been increased to approximately 70%. This 

15 demonstrates the applicability of utilizing the fad3 gene to manipulate the 
fatty add profile of crop plants. 

In order to demonstrate that the alteration in the fatty acid 
profile of the FAMES extracted from total seed tissue would be reflected in 
the seed oil fraction, triglycerides from seeds of fad3 antisense plants were 

20 characterized. Total lipid extracts were made by pooUng ten seeds and 
grinding in 2ml of methanol.chloroformrwater (4:2:1). The homogenate was 
allowed to stand for 20min and then debris was pelleted and discarded. To 
the supernatant 400fxl of chloroform:methanol (2:1), 640fxl of chloroform 
and 740iil of water was added and vortexed. Phases were separated by 

25 centrifugation and the chloroform phase was recovered and dried under 
nitrogen. Samples were resuspended in lOO^il of chloroform and lOjil was 
applied to silica gel G thin layer chromatography plates for separation. 
Two identical plates were prepared with one being charred after 
development to allow for alignment and location of spots to be analyzed on 
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the other plate. Plates were developed three times in petroleum 
etherrdiethyl ether:acetic acid (90:10:1). One plate was sprayed with 50% 
sulfuric acid and heated in an oven at 90'C to allow for detection of lipids. 
Triglyceride fractions were identified as comigrating on the plate with 
5 purchased lipid standards (Sigma Chemical Co, cat #178-13). The chaired 
plate was aligned with the identical plate and the triglyceride fractions were 
scraped from the plate. The fatty acids were transesterified to produce 
FAMES extracts for GC analysis hy the same procedure as above. The 
fatty acid profiles of the triglyceride fractions are shown in Table 3 and 
10 demonstrate that this fraction have decreased linolenic acid. 

TABLES 





Transgenic 


Mol% 






15 


line 


18:1 


18:2 


18:3 




17227-10 


44 


30 


15.3 




17227-493 


65 


17 


6.9 




13804-47 


58 


21 


4.3 


20 


13804-50 


67 


20 


2.8 




13804-76 


59 


19 


5.0 




13804-117 


62 


21 


4.0 



Table 3 compares the fatty acid molar percentages of 
25 triglyceride fractions from control and transgenic lines. These above 
results provide clear evidence that the fadS gene can be used to decrease 
the levels of linolenic add in the storage oil of plants. The gene provides a 
tool for the manipulation of the fatty acid profile of seed storage oil to 
improve the products derived from the oil. 
30 A siirprising result of this Example 2 is the effect the antisense 

fads gene has on the oleic acid content. The precise mechanism by which 
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antisense expression of a gene exerts an effect on the activity of an 
endogenous gene is unclear but is obviously a function of the homology of 
the sense and antisense gene products. Based upon the above 
experimental result, it would not be unreasonable to predict that the 
5 portion of the fadS gene antisense message used contained a certain degree 
of homology with the genes providing the activity of one or more oleate 
desaturases. Therefore, a further advantage of the above invention is that 
it is possible that expression of a linoleic acid desaturase antisense 
message may exert an effect on oleate desaturase activity* 

10 The unexpected nature of the reduction in oleic acid desaturase 

activity from the antisense fadS plants is most apparent when one 
compares the fatty acid profiles from the antisense plants and the fadS 
mutant of Arabidopsis. The levels of linoleic add in the fadS mutant plants 
increased when linoleic acid desaturase activity was eliminated by 

15 mutation. This indicates that the activity of the oleate desaturase was not 
highly effected by the loss of linoleic acid desaturase activity or by the 
accumulation of linoleic acid. In the fadS mutant of Arabidopsis the level of 
linoleic add increased when the level of linolenic add decreased. However, a 
different pattern occurred in the antisense fad3 plants. In plants which 

20 exhibit a decreased percent of linolenic acid there is no corresponding 
increase, and is often a decrease, in the percent of Unoleic acid. There is an 
increase in the percent of oleate in the antisense fadS plants. This would 
indicate that oleate desaturase activity is depressed in these plants. The 
effects on the fatty acid profile by the fadS mutation and the fadS antisense 

25 expression are not equivalent, indicating that antisense expression of a 
linoleic acid desaturase can depress an oleate desaturase activity in plants. 
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Example 3 

Modification of linolenic add levels in soybean 

The isolation of the fadS gene from B. napus provides a tool to 
those with ordinary skill in the art to isolate the corresponding gene or 
5 cDNA from other plant species. There are many examples in which genes 
from one plant species have been used to isolate the homologous genes 
from another plant species. One such plant which could be improved upon 
by the modification of the level of linolenic acid is soybean. 

Soybean oil typically contains linolenic add at a level of 7-9% of 

10 the fatty add in the oil. This level is imdesirable because it promotes 
instability upon heating and imparts randdity to the finished product. The 
levels of linolenic add can be lowered by the expression of the soybean fadS 
gene or cDNA in an antisense orientation in the developing seed. The 
following example describes one method for the isolation of a fadS cDNA 

15 from soybean. However, similar procedures could be followed to isolate a 
genomic clone which could also be used to decrease the level of linoleic add 
desaturase activity by antisense expression of a portion or all of the gene. 

The fads gene from B.napus is used as a probe to screen a cDNA 
library constructed from soybean mRNA. In order to isolate a cDNA to be 

20 used in decreasing linolenic acid in seed, the optimal tissue to use for the 
isolation of naRNA is developing seed. There is, however, flexibility in the 
choice of methods and vectors which can be used in the construction and 
analysis of cDNA libraries (Sambrook et al, 1989). Procedures for the 
construction of cDNA libraries are available from manufacturers of cloning 

25 materials or from laboratory handbooks such as Sambrook et.al, 1989. 
Once a suitable cDNA library has been constructed from soybean, all or a 
portion of the fad3 cDNA from B, napus is labeled and used as a probe of the 
library. DNA fragments can be labeled for radioactive or non-radioactive 
screening procedures. The library is screened under suitable stringency. 
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Conditions are dependent upon the degree of homology between the fadS 
gene of B. napus and soybean. Probe positive clones are plaque purified by 
standard procedures and characterized by restriction enzynae mapping and 
DNA sequence analysis. Clones are concluded to be soybean fad3 based 
5 upon data obtained from the sequence analysis or by expression in plants. 

The entire clone or a portion thereof is placed down stream of a 
promoter sequence in an antisense orientation. Suitable promoters include 
seed specific promoters, such as the 7S (P-conglycinin) a'-subunit 
promoter, or less tissue specific promoters, such as the CaMV 35S 

10 promoter. An appropriate 3' non-translated region is placed downstream of 
the antisense cDNA to allow for transcription termination and for the 
addition of polyadenylated nucleotides to the 3'end of the RNA sequence. 
This expression cassette is then combined with a selectable or scorable 
marker gene and soybean cells are transformed by free DNA delivery 

15 (Christou et al, 1990) or an Agrobacterium based method of plant 
transformation (Hinchee et al, 1988). Plants recovered are allowed to set 
seed and mature seed are used for the production of FAMES by the 
procedures outlined above. The FAMES extracts are analyzed by gas 
chromatography to identify plant lines with reduced levels of linolenic acid 
20 in the seed. 

Alternatives to the above methods may include but are not 
limited to the use of degenerate oligonucleotides as probes to screen the 
library. Degenerate oligonucleotide probes would be most optimally 
designed by choosing short segments of the fadS amino acid sequence where 
25 the degeneracy of the genetic code is limited or by choosing sequences which 
appear to be highly conserved between the fad3 gene of J5. napus and other 
known linoleic acid desaturases, such as the desaturase from the 
cyanobacterium Synechocystis, The oligonucleotides could be labeled and 
used to probe a soybean cDNA library. Alternatively, degenerate 
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oligonucleotides coiild be used as pximers for the isolation of a portion or all 
of the soybean cDNA by PGR amplification. 

Similar procedures cotdd be used to isolate the homologous genes 
from other plant species. Another preferred plant species which could be 
5 improved upon by the modification of the level of linolenic add is flax. Flax 
oil tjT)ically contains linolenic add at a level of 45-65% of the fatty add in 
the oil. This level is undesirable because it promotes instability upon 
heating and imparts randdity to the finished product. 
Example 4 

10 Sense expression of fad3 to obtain reduced levels of linolenic acid 

The cloning of the fadS gene also provides a tool to decrease the 
levels of linolenic add via the mechanism of co-suppression. The molecular 
mechanism of co-suppression occurs when plants are transformed with a 
gene that is identical or highly homologous to an allele found in the plants 

15 genome (Bird and Ray, 1991). There are several examples where 
expression of a chimeric gene in plants can result in a reduction of the gene 
product from both the chimeric gene and the endogenous gene(s). Therefore 
the fads gene product of B. napus may be reduced by transformation of B. 
napus with all or a portion of the fadS cDNA which has been isolated. The 

20 resulting plant has reduced linoleic acid desaturase activity in tissues 
where the chimeric gene is expressed. The phenotype of reducing the 
linoleic add desatiirase activity is a reduction in the levels of linolenic acid. 
The mechanism of co-suppression could be applied to any plant species 
from which the fadS gene is cloned and the plant species is transformed 

25 with fadS in a sense orientation. 

In order to reduce levels of linolenic acid by the mechanism of co- 
suppression, a plant transformation construct is assembled with the fadS 
gene or cDNA in a sense orientation. The entire done or a portion thereof is 
placed downstream of a promoter sequence in a sense orientation. Suitable 
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promoters include seed specific promoters, such as the 7S ( p-conglycinin) 
a'-subunit promoter, or less tissue specific promoters, such as the CaMV 
35S promoter. An appropriate 3* non-translated region is placed 
downstream of the fadS gene to allow for transcription termination and for 
5 the addition of polyadenylated nucleotides to the 3* end of the RNA 
sequence. This expression cassette is then combined with a selectable 
marker gene and J5. napus cells are transformed by an Agrobacterium 
based method of plant transformation. Plants recovered are allowed to set 
seed and mature seed are used for the production of FAMES which are 
10 anal3rzed by gas chromatography to identify plant lines with reduced levels 
of linolenic add in the seed. 

Isolation of a chloroplast delta 15 desaturase fi-om Arabidopsis 

A fragment of 959bp was excised from the fad3 cDNA insert 

15 using the restriction endonuclease BglU, and labeled radioactively according 
to Feinberg and Vogelstein (1983). This fragment was used to probe a 
cDNA library from Arabidopsis thaliana as described above (Example 1) 
except that the hybridization temperature was 52^ C. Several cDNA 
clones were positive, and one of them (pVAl) was further characterized. 

20 Its deduced amino acid sequence exhibited a strong homology with fadS 
except at the N-terminus. The cDNA insert was placed under the control of 
the 35S promoter in the Ti vector pBI121, and the resulting construct, 
pBIVA12 was electroporated into Agrobacterium (C58 pGVSlOl), The 
bacterium was used to transform the Arabidopsis mutant fadD. For 

25 transformation, plants were grown at 22° C with a light intensity of 
lOO/p.E/cm-2, until bolting (approximately 2 and 1/2 weeks). The stems 
(Imm-lOmm long) were removed and the plants were inoculated with a 
drop of an overnight culture of the bacterium. The same operation was 
repeated 7 days afterwards. 
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The plants were then allowed to set seeds. The seeds were 
plated (2500 seeds per 150mm petri dish) on MSO plates that contained 
50^g/ml kanamydn to select for plants that had integrated the construct. 
One transfcrmcuit plant was obtained, and the fatty acids from its leaves 
5 were analyzed by gas chromatography (Table 4). The results obtained 
show that the pBIVA12 construct is able to reestablish the levels of 
linolenic and hexadecatrienoic adds in the fadD mutant at a level equal to 
or superior to the wild type. This demonstrates that pVA12 encodes the 
fadD gene. 

10 

TABLE 4 



fatty add fadD WT FadD 

pBIVA12 

15 

16:0 13.0 14.0 14.9 

16:1 4-9 4.3 4.2 

16:2 8.7 0.5 0.3 

16:3 3.0 13.2 9.5 

20 18:1 3.3 2.3 1.2 

18:2 36.4 10.9 5.8 

18:3 30.8 54.6 63.7 



Table 4 shows the complementation of the fadD mutant. 
25 Fatty acids were extracted from leaves of Arabidopsis according to Browse 
et al (1986) and were quantified (mol%) by gas chromatography. WT 
stands for the Coliombia wild type. 
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Example 6 

Isolation of a second chloroplast delta 15 desaturase from Arabidopsis 

A fragment of 959 bp was excised from the cDNA insert using 
the restriction endonuclease Bglll, and labelled radioactively according to 
5 Feinberg and Vogelstein (1983), This fragment was used to probe a cDNA 
library from Arabidopsis, exactly as described above (Example 5). Among 
the several positive clones obtained, the cDNA pVA34 was further 
characterized. Its deduced amino acid sequence exhibited 71.8% and 79.5% 
homology with fad3 and fadD, respectively. The N-terminus resembled a 

10 chloroplast transit peptide, meaning that this protein is likely to be 
localized to the chloroplast. The strong homology with fadS and fadD 
suggests that the protein is also a delta 15 desaturase. Aside from fadS 
and fadD, the only locus known to control delta 15 desattiration is the fadE 
locus, which controls a temperature-induced delta 15 desaturase. 

15 Therefore, it is likely that the cDNA contained within the clone pVA34 
corresponds to the fadE locus. 

Ex^rmple.T 

Linoleic desaturase homology to plant oleic desaturases 

The linoleic desaturase genes are the first plant desaturases 

20 isolated whose proteins enzymatically perform the desaturation of an 
unsaturated fatty acid precursor. The reaction that linoleic desaturase 
performs and the cofactors it uses are likely to be very similar for the oleic 
desaturase reaction. Given the similar reactions, similar substrates and 
probably similar cofactors, it is likely that the oleic desaturase genes and 

25 proteins have homology to the linoleic desaturase genes and proteins. That 
the genes share homology is supported by the finding that antisense 
expression of the linoleic acid desaturase message results in higher oleic 
acids levels, which experimentally indicates homology between the linoleic 
and oleic desaturases. These factors indicate that the linoleic desaturase 
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protein and nucleic acid sequences provide useful information for isolating 
other lipid desaturase genes, particularly oleic desaturase genes. 

a. Identification of unknown cDNA sequences in databases. 
5 Random cDNA sequencing generates a large number of 

sequenced clones but provides no information about the function of the 
encoded proteins. Homology to known proteins is the quickest method for 
identifying the protein function encoded in the sequenced cDNA. However, 
homology searches are informative only when a homology with a previously 

10 characterized protein are found. A cDNA sequence that is not homologous 
to any known protein remains in the imknown function category. Thus the 
results functionally identifying the linoleic desaturases by sequence and by 
their ability to complement mutations in plant desaturase genes now 
provides a method for identifying the function and identity of random cDNA 

15 clones by their homology to the linoleic desaturases. Additionally oleic 
desaturases are identified by their homology with linoleic desaturases. 

A TFASTA search of the GenBank and EMBL public data 
bases for genes encoding proteins homologous to the protein sequence of the 
linoleic desaturase fadS has identified both linoleic desaturases and a 

20 second class of plant lipid desaturases likely to be oleic desaturases. In 
particular, sequences found in GenBank and EMBL and identified as 
T04093 and T12950 show significant homology to linoleic desaturases but 
show less homology than other linoleic desaturases. These sequences have 
30% homology to fadS and 56% similarity to fadS linoleic desaturase 

25 (TABLE 5). The full length clone of these cDNAs is obtained by standard 
methods and is inserted into plant gene expression and transformation 
vectors and transformed into fad2 Arabidopsis mutants to confirm the 
identity of the oleic desaturase by genetic complemention as was described 
in the example with linoleic desaturase. 
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5 



TABLE 5 

Comparieon of Fad3 and T04093 Protein Seg:uencei 

\ 

Percent Similarity: 52.381% Percent Identity: 30.476% 



fad3 101 GHGSFSDIPLLNSWGHILHSFILVPYHGWRISHRTHHQNHGHVENDESW 150 
10 I : I I I : I n I : I : . M I M I I : I . I I : 

T04093 1 LIFHSFLLVPYFSWKYSHRRHHSNTGSLERDEVF 34 

151 VPLPEKLYKNLP HSTRMLRYTVPLPMLAYPIYLWYRSPGKEGSHF 195 

15 I I .... I . . . - I : : . . I I . : : I : : I : i I : . . I : . : 

3 5 VPKQKSAIKWYGKYLNNPLGRIMMLTVQF , VLGWPLYLAFNVSGR , . . PY 80 

196 NPYSSLFAPSERKLIATSTTCWSIMLATLVYLSFIiVDPVTVLKVYGVPYI 245 

| I.,, , , , , • 

20 81 DGFACHFFPNAPIYNDRERSRYTSLMRVF* no 



b. Isolation of a oleic desaturase cDNA. 

25 The protein sequence of plant linoleic desaturases can be used 

to isolate oleic desaturases. The conserved regions between the linoleic 
desaturases and the DesA oleic desaturase are functionally important and 
are conserved in the plant oleic desaturase proteins as well. These 
conserved amino acid sequences provide a method of isolating plant oleic 

30 desaturases. There are several regions of the linoleic desaturase fad3 that 
are conserved in fadD, fadE and DesA. The consensus amino acid sequence 
is shown in Table 6, with the amino acids identical in all four proteins shown 
in capital letters. As described below, oligonucleotides designed to encode 
the amino acids sequences in the conserved regions are used to identify and 

35 isolate plant oleic desaturases. 
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10 



TABLE 6 

Fad3 Protein Sequence and Peptide Targets 

mwamdorsnvngdsgarkeegfdpsaoppfkigdiraaipkhcwvksplrsmsyvtrd' 

V. cplttp . . . spseed. . erf dpgapppf . laDIraaiPKhCwvKnpwksmsyVvrd 

(la) DIraaiP 

(lb) aiPKhC 
(Ic) KhCwvK 



IFAVAALAMAAVYFDSWFLWPLYWVAQGTLFWAIFVLGHDCGHGSFSDIPLLNSVVGHIL 
va . vf alaa . aayf nnW. IwPlyW. aqGTmf walFVlGHDCGHgSFsndp . INswGH . 1 

MflwPlvWvaaGT rVlGHPCGHqSF 
(2a) WflwPlyW (3a) FVIGHD 

15 (2b) WflwP (3b) VIGHDC 

{2c) wPlyW (3c) GHDCGH 

(2d) WvaqGT {3d) CGHgSF 

HSFILVPYHGWRISHRTHHONHGHVENDESWVPLPEKLYKNLPHSTRMLRYTVPLPMLAY 
20 hssilvPyHgWRisHrtHHqnhghvEnDesWhPl . ekiy)cnlpk . trmf rf tlpipmlay 

PvHoWRisHrtHH SnPgSWv? 
(4a) PyHgVJ iSa) EnDesW 

<4b) HgWRisH (5b) DesWvP 

(4c) WRisHrtHH 
25 (4d) WRisH 

{4e) HrtHK 

PIYLWYRSPGKEGSHFNPYSSLFAPSSRKLIATSTTCWSIMLAT . LVYhSFLVDPVTVhK 
pfylw. rspgk . gShyhDds . XF.pkerkdvltScacwcamaAl . IvcLnf t . gpiomlK 

30 

VYGVPYIIFVMWLDAVTYLHHHGHDEKLPWYRGKEWSYLRGGL . TTIDRDYG , IFNNIH 
lygiPyvif vmWldfvTylHHhghedkipwyrgkeWSylrggL . tTldrDYg . winnih 

WldavTvlHH WSvlraaL.tTidrDy 
(6a) WldavT {7a) WSylrggL 

35 (6b) TylKH (7b) L tTidrD 

(7c) TidrDY 
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HDIGTHVIHHLFPOIPHYHLVDATRAAKHVLGRYYREPKTSGAIPIHLVESLVASIK 
HDIgtHviHHLfpqIPhYhLveAteaaKpvlGkyyrEpk.sgplplhLlesl.ksik 

HDTotHviHHLfPQiPhY 
5 (8a) HDIgtH 

(8b) HviHHL 

(Be) HHLfpql 
(8d) HLfpqIP 
(8e) LfpqIPhY 



KDHYVSDTGDIVFYETDPDLYVYASDKSKIN* 
. dhyvsdtGdwy Yeadp . lyg . . s * 



15 c. Isolation of the fadC (fad6) Gene from Arabidopsis thaliana 

The fade gene (also referred to as fad6) encodes a 
chloroplastic oniega-6 desaturase. 

The deduced amino acid sequences of the fadS gene from 
Brassica napus and the fadD and fadE genes from Arabidopsis thaliana 

20 were compared with the DesA gene from Synechocystis {Nature^ 347:200, 
1990). The sequence GHDCGH was determined to represent the most 
highly conserved region of these proteins. Consequently, a degenerate 
oligomer was designed that contains all the possible condons for the 
sequence GHDCGH. This oligomer has the following sequence: 

25 GGNCAYGAYTGYGGNCA. 

An Arabidopsis thaliana cDNA phage library obtained from 
the laboratory of Dr. Ron Davis (PNAS, 88: 1731-1735) was used to screen 
for desaturase genes. This library was made using material from all above 
ground plant parts. 

30 Approximately 120,000 phage from the library were plated 

onto three plates and hybondN+ was then used to prepare three filters from 
each plate (Molecular Cloning - A Laboratory Manual , 2nd Edition. Eds. J. 
Sambrook, E. F. Fritsch, and T. Maniatis, Cold Spring Harbor Laboratory 
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Press, Cold Spring Harbor, New York 1989, hereafter "Sambrook"). Two 
filters from each plate were probed using the degenerate consensus 
oligomer which had been end-labelled with (32)P using T4 pol3mucleotide 
kinase. The hybridizations were performed in a solution that contained high 
5 amounts of tetramethylammonium chloride in order to minimize differences 
in the melting temperatures of the oligomers that together comprise the 
degenerate consensus oligomer. The hybridization solution had the 
following composition: 3 M tetramethylammonium chloride, 10 mM sodiimi 
phosphate pH 6.8, 1.25 mM EDTA, 0.5% SDS, 0.5% milk. Hybridization 
10 was carried out overnight at a temperatxire of 44®C. Filters were then 
washed four times, 20 minutes each time, with 6 x SSC + 0.15% SDS at 
room temperature. Filters were then washed one time, for 30 minutes, with 
4 X SSC + 0.1% SDS at room temperature. The filters were then exposed to 
film for two days. 

15 The third set of filters that were made from each phage- 

containing plate were probed using DNA sequences from the three 
Arabidopsis desaturase genes that had already been identified: fadS, fadD 
and fadE. The fadS, fadD and fadE genes were labelled with (32)P and 
hybridized to the third set of phage filters in the following hybridization 

20 solution: 0.2 M NaCl, 20mM sodium phosphate pH 7.7, 2mM EDTA, 1% 
SDS, 0.5% milk, 10% dextran sulfate, 0.1% sodium pyrophosphate. 
Hybridization was carried put overnight at 65-C. Filters were washed four 
times, 30 minutes per time, in 2 x SSC + 0.15% SD at room temperature 
and then for 45 minutes with 1 x SSC + 0.1% SDS at 65- C. The filters 

25 were then exposed to film for approximately two hotirs. 

The two sets of filters that were probed with the degenerate 
consensus oligomer showed about 60 positive phage per plate (or about 180 
total positive phage). Results fi-om the third set of filters that were probed 
with the fadS, fadD and fadE genes indicated that only a small percentage 
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of the phage that hybridized to the consensus of oligomer contained the 
fads, fadD or fadE genes. 

Seventy-six of the phage that hybridized to the consensus 
oligomer, but not to the fadS, fadD or fad£ genes, were plaque purified. The 
5 purified phage were then spotted onto bacteria growing on solid media on 
plates and allowed to form plagues. Several duplicate filters were then 
made of these plates (Sambrook). One of these filters was probed with the 
consensus oligomer, as described above. A second filter was probed with a 
mixture of the Arabidopsis thaliana fadS, fadD and fadE genes, as 

10 described above. 

In order to determine which of the 76 phage contained the 
same cDNA inserts as which other phage, some of the filters were probed 
with cDNA inserts from some of the phage. In order to perform this 
experiment, the cDNA inserts fi-om most of the phage were isolated by 

15 using oligomers that bound to DNA flanking the cDNA cloning site in the 
phage vector to isolate the cDNA sequences using the polsonerase chain 
reaction (PGR). These cDNA sequences were labelled with (32)P (random 
hexamer labelling) and hybridized to the filters using the following 
hybridization solution: 30% formamide, 0,2M NaCl, 20mM sodium 

20 phosphate pH 7.7, 2mM EDTA, 1% SDS, 0.5% milk, 0.1% sodium 
pyrophosphate. The hybridizations were carried out for 14 hours at 65-C. 
The filters were washed four times 15 minutes per wash, with 2 x SSC + 
0.15% SDS at room temperature and were then exposed to film. 

The combination of the high formamide concentration in the 

25 hybridization solution and the high hybridization temperature meant that 
only DNA sequences that were virtually identical would hybridize, allowing 
us to distinguish between nearly identical sequences. Several rounds of 
hybridizations using cDNA inserts from different phage were carried out 
imtil it had been determined which phage contained the same, or at least 
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extremely similar, cDNA inserts* On the basis of these experiments, we 
determined that all of the 76 phage contained one of four cDNA inserts. 
Sequence data was obtained from each of these fom^ cDNAs. None of these 
cDNAs was found to be homologous to known desaturase genes, and so we 
5 feel that none of these four cDNAs is likely to encode a desaturase. 

Since the number of phage that hybridized to the consensus 
oligomer was quite high (about 180 phage hybridized in the initial screen 
described above), we were not able to analyze all of the positive phage in 
the initial experiments. So, an attempt was made to identify phage that 

1 0 hybridized to the consensus oligomer but that did not contain the fadS, fadD 
of fadE genes or one of the four non-desaturase encoding clones that were 
identified in the first screen. In order to do this, between 500,000 and 
1,000,000 phage from the library described above were plated onto 10 
plates. Three filters were made firom each plate (Sambrook). Two of these 

15 three sets of filters were then hybridized with (32) P labelled consensus 
oligomer as described above except that hybridization was carried out at 
42^0 instead of at 44^0. The third set of filters were hybridized with (32)P 
labelled DNA fi-om the Arabidopsis fadS, fadD and fadE genes together with 
DNA &*om each of the four cDNA's identified in the first round of screening 

20 as hybridizing to the consensus oligomer but not encoding desaturases. 
This third set of filters were hybridized in: 30% formamide, 0.2 M NaCl, 
20mM sodium phosphate pH 7,7, 2mM EDTA, 1% SDA, 0.5% milk, 0.1% 
sodium p3rrophosphate at 65^C. All three sets of filters were hybridized for 
12 hours and then washed several times with 2 x SSC + 0.15% SDS at 

25 room temperature. The filters were then exposed to film. 

Approximately 200 phage from each plate hybridized to the 
consensus oligomer. 50-60% of these phage also hybridized to fad3, fadD, 
fadE or to one of the four clones identified in the first screen. About 58 
phage that hybridized to the consensus oligomer, but not to fadS, fadD, 
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fadE or one of the four previously identified clones, were plaque purified. 
The purified phage were then spotted onto a bacterial lawn growing on solid 
media on a petri plate and the phage were allowed to form plaques. Several 
filters were prepared fi:om these plates and hybridized with (32 )P labelled 
5 cDNA inserts fi:om various of the newly purified phage, as described above. 
In this manner, all of the phage identified in this second round of screening 
were found to contain one of eight different cDNA inserts. 

Sequence data was obtained fi*om each of the eight cDNA's. 
One of the cDNA*s, which was contained within only one of the phage, was 

10 foimd to have some sequence similarity of a known desaturase gene fi-om 
cyanobacteria^ the DesA gene. Further sequence information was obtained 
for this clone. This additional sequence showed very significant sequence 
similarity to the DesA gene, confirming that the clone contained a 
desaturase gene. The remainder of the cDNA contained within the clone 

15 was sequenced and compared with the sequences of other known 
desaturases. The new desaturase was 53.0% identical to DesA at the 
nucleotide level and 43,9%, 45.6% and 47.0% identical to B. napus fadS, 
Arabidopsis fadD and Arabidopsis fadE, respectively. As the gene 
contained within the clone was significantly more similar in sequence to the 

20 DesA gene (which is a delta-12 desaturase) than to fadS, fadD or fadE 
(which are omega-3 desaturases), the new desaturase was expected to be a 
delta-12 (= omega-6) desaturase. 

The additional sequence data also indicated that this new 
desaturase gene contains a region that has only a one base pair mismatch 

25 to the desaturase consensus sequence described above. This mismatch 
means that the new desaturase has the sequence GHDCAH instead of 
GHDCGH. 

A clone containing a full length cDNA for this gene was 
isolated and completely sequenced. This ftdl length cDNA was sub-cloned 
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into the plant transformation vector pBII121 such that the gene is 
transcribed under the control of the 35S promoter. This construct was 
then used to complement the phenotype of a fadC mutant (Plant Phys, 90: 
522-529, 1989) of Ambidopsis thaliana, indicating that the gene encodes a 

5 chloroplastic omega-6 desaturase. 

d Proposed isolation of fad2 

The most highly conserved peptide regions in the linoleic 
desaturases and the DesA desaturase were chosen as regions likely to be 
conserved in oleic desaturases. These 8 conserved regions are shown in 

10 TABLES. These regions were chosen on the following basis: These regions 
have areas highly conserved between the 3 linoleic desaturases and DesA, 
with at least 4 identical amino acids over a 10 amino acid span. Once a 
region was identified as conserved, the fad3 linoleic desattirase sequence 
was used as the amino acid sequence for the source of homology to identify 

15 oleic desaturases. This is because both fadS and the non-plastid oleic 
desaturases are thought to be localized to the endoplasmic reticulum and 
are most likely to contain similar amino add sequences. 

Several peptide endpoints in each conserved area were chosen 
as the basis to subsequently design oligonucleotide probes for identifying 

20 the oleic desaturase gene. The peptide endpoints were chosen to be 
between 5 and 9 amino acids in length. The peptide end points were chosen 
to end on the conserved (identical) amino acids, and most often to begin on 
conserved amino acids. The rationale is that within the larger conserved 
area, some amino acid portions are more highly conserved than others, that 

25 15 to 27 (5 to 9 amino adds) nucleotides is a good primer size for PGR, and 
that for PGR it is important that the 3' end of the primer matches the 
target, with the conserved (identical) amino acids the most likely to be 
present in the oleic desaturases. These 28 **oleic desaturase" peptide 
targets (Table 6) are the basis oligonucleotides that are designed for 
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hybridizing to the oleic desaturase cDNA sequences to identify and isolate 
the oleic desaturase cDNA clone. 

Several possible methods for designing oligonucleotides and 
isolating the genes encoding the target peptide regions are knoWn. For a 
5 discussion of designing degenerate oligonucleotides see PCR Protocols - A 
Guide to Methods and Applications, Eds M. A. Innis, D. H* Grelfand, J J 
Sninsky and T. J. White, Academic Press, San Diego, California, 1990; and 
Sanxxxxx The two most common screening methods using the 
oligonucleotides are screening cDNA libraries and PCR amplification of 

10 specific cDNAs. Gene probes from fadS, fadD and fadE are used under 
stringent hybridization conditions to identify these cDNAs and discard 
them in the screen for oleic desaturase cDNA clones. The method for using 
degenerate ohgonucleotides to screen a cDNA librazy has been described in 
the example above demonstrating the isolation of the fadC oleic desaturase 

15 gene. An immature plant seed active in oil biosynthesis, generally 2 to 5 
weeks after pollination, preferably about 3 to 4 weeks after pollination, of a 
plant such as Arabidopsis or canola is used as the source of mRNA for 
making cDNA. First strand cDNA is made firom the isolated mRNA and 
hybridized imder stringent conditions in solution to an excess of biotinylated 

20 fadS, fadD and fadE cloned cDNAs. The hybrids and biotinylated nucleic 
acids are removed with strepavidin and a second rotmd of substraction is 
done to remove any remaining fadS, fadD and fadE sequences. The cDNA 
remaining in solution is xised for PCR reactions. (For 5' RACE, see below, a 
polyA tail is added to the first strand cDNA 3' end). 

25 A method that can readily evaluate a munber of degenerate 

oligonucleotides probes is degenerate PCR (See chapters by Compton and 
by Lee and Caskey in PCR Protocols^ cited above). In this method a 
degenerate set of oligonucleotides encompassing all the possible codon 
choices for the target peptide is synthesized (such degenerate 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 



PCT/US94/01321 



-57- 



targets (Table 6} are the basis oligonucleotides that are designed for 
hybridizing to the oleic desaturase cDNA sequences to identify and isolate 

the oleic desaturase cDNA done. 

Several possible methods for designing oligonucleotides and 
5 isolating the genes encoding the target peptide regions are known. For a 
discussion of designing degenerate oligonucleotides see PCR Protocols - A 
Guide to Methods and Applications, Eds M. A. Innis» D. H. Gelfand, J J 
Sninslo^ and T. J. White, Academic Press, San Diego, California, 1990; and 
Sambrook. The two most common screening methods using the 

10 oligonucleotides are screening cDNA libraries and PCR amplification of 
specific cDNAs, Gene probes from fadS, fadD and fadE are used under 
stringent hybridization conditions to identify these cDNAs and discard 
them in the screen for oleic desatxarase cDNA clones. The method for using 
degenerate oligonucleotides to screen a cDNA library has been described in 

15 the example above demonstrating the isolation of the fadC oleic desaturase 
gene. An immatiure plant seed active in oil biosynthesis, generally 1 to 5 
weeks after pollination, preferably about 2 to 4 weeks after pollination, of a 
plant such as Arabidopsis or canola is used as the source of mRNA for 
making cDNA First strand cDNA is made from the isolated mRNA and 

20 hybridized under stringent conditions in solution to an excess of biotinylated 
fads, fadD and fadE cloned cDNAs. The hybrids and biotinylated nucleic 
acids are removed with strepavidin and a second round of substraction is 
done to remove any remaining fadS, fadD and fadE sequences. The cDNA 
remaining in solution is used for PCR reactions. (For 5* RACE, see below, a 

25 polyA tail is added to the first strand cDNA 3' end). 

A method that can readily evaluate a nimiber of degenerate 
oligonucleotides probes is degenerate PCR (See chapters by Compton and 
by Lee and Caskey in PCR Protocols, cited above). In this method a 
degenerate set of oligonucleotides encompassing all the possible codon 
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TABLET 

Peptide Targets for Fad2 Cloning 



10 



15 



20 



25 



30 



Peptide sequence 



35 



la 

lb 

Ic 

2a 

2b 

2c 

2d 

3a 

3b 

3c 

3d 

4a 

4b 

4c-l 

4c-2 

4d 

4e 

5a 

5b 

6a 
6b 

7a-l 

7a-2 

7b 

7c 

6a 

8b 

8c 

8d 

8e 



DIRAAIP 

AIPKHC 

KHCWVK 

WFLWPLYW 

WFLWP 

WPLYW 

WAQGT 

FVLGHD 

VLGHDC 

GHDCGH 

CGHGSF 

PYHGW 

HGWRISH 

WRISHRTHH 

WRISH 
HRTHH 
ENDESW 
DESWVP 

WLDAVT 

TYLHH 

WSYLRGGL 

LTTIDRD 

TIDRDY 

HDIGTH 

HVIHHL 

HHLFPQI 

HLFPQIP 

LFPQIPHY 



Oligo sequence 5 ' - 3 ' 

GAYATHMGNGCNGCNATHCC 

GCNATHCCNAARCAYTG 

AARCAYTGYTGGGTNAA 

TGGTTYYTNTGGCCNYTNTAYTGG 

TGGTTYYTNTGGCCN 

TGGCCNYTNTAYTGG 

TGGGTNGCNCARGGNAC 

TTYGTNYTNGGNCAYGA 

GTNYTNGGNCAYGAYTG 

GGNCAYGAYTGYGGNCA 

TGYGGNCAYGGNWSNTT 

CCNTAYCAYGGNTGG 

CAYGGNTGGMGNATHWSNCA 

TGGMGNATHTCNCAYMGNACNCAYCA* 

TGGMGNATHAG YCAYMGNACNC AYCA * 

TGGMGNATHWSNCAY 

CAYMGNACNCAYCAY 

GARAAYGAYGARWSNTGG 

GAYGARWSNTGGGTNCC 

NGTNACNGCRTCNARCCA 

RTGRTGNARRTANGT 

AKNCCNCCNCKNARRTARCTCC A * 

ARNCCNCCNCKNARRTANGACCA * 

RTCNCKRTCDATNGTNGTNA 

RTARTCNCKRTCDATNGT 

RTGNGTNCCDATRTCRTG 

NARRTGRTGDATNACRTG 

DATYTGNGGRAANARRTGRTG 

GGDATYTGNGGRAANARRTG 

RTARTGNGGDATYTGNGGRAANA 



* synthesize 4c and 7a in two pools each to limit the 
40 degeneracy 

Oligos for 6a - 8e are the complement of the coding 
sequence 
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10 



15 



20 



TABLES 

Table of Oligomers for PGR RACE of fod2 

Peptide # Oligo Length Fold Similarity Similarity in 

Degeneracy with L26296 Last 10 n.t. 

la 20 384 75 % 80 % 

lb 17 192 88 80 

Ic 17 32 66 80 

2a 24 64 79 100 

2b 15 48 73 80 

2c 15 48 100 100 

2d 17 128 76 90 

3a 17 384 76 70 

3b 17 384 82 80 

3c 17 128 88 90 

3d 17 384 82 70 



4a 15 64 80 70 

4b 20 192 76 90 

4c 26 96* 81 80 

4d 15 216 87 90 

25 4e 16 192 87 80 

5a 18 96 72 80 

5b 17 96 76 80 

30 6a 18 256 78 80 

6b 15 192 93 100 

7a 23 256* 78 60 

7b 20 384 90 80 

35 7c 18 192 94 90 

8a 18 384 72 70 

8b 18 192 89 80 

8c 21 384 81 100 

40 8d 20 192 80 90 

8e 23 192 83 70 

* done in two oligo pools 
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Table 7 shows the 28 peptide targets from the eight conserved 
regions and the 30 degenerate oligonucleotides derived from the peptide 
sequences. The degeneracy was kept to less, than 516 fold, for those 
instances where more degeneracy occurred, by the use of deoigdnosine 
5 (Sambrook et al.) and by not including the last nucleotide in the last codon, 
and in two cases by the use of two subpools. Table 8 shows the amoimt of 
degeneracy for each designed oligonucleotide sequence and the amoimt of 
homology of the oligonucleotides to the Arabidopsis oleic desaturase fad2 
(Accession No, L26296). Also shown in Table 8 is the percent homology in 

10 the last 10 nucleotides on the 3' end of each primer, since this region is most 
important for annealing and elongation under PGR conditions. It is 
expected that both 10 of 10 and 9 of 10 homology matches, and probably 8 
of 10 homology matches in the 3' primer regions will serve as efficient PGR 
primers. Note that for oligonucleotide sets la through 5b (for 3' RACE) the 

15 strand direction is the same as the mRNA while for oligonucleotide sets 6a 
through 8e (for 5' RAGE) the direction is opposite of the mRNA. Four 
oligonucleotides have a 10 of 10 match in the 3' position, 6 oligonucleotides 
match 9 of 10 in the 3' position and 12 match in 8 of 10 nucleotides in the 3* 
position. OUgonucleotides corresponding to peptides 2a, 2c, 2d, 3c, 4b, 4d, 

20 6b, 7c, 8c, and 8d show 90% or greater homology in their last 10 
nucleotides and anneal to the oleic desaturase gene and serve as primers to 
this gene. This demonstrates the validity of using the conserved regions of 
the plant linoleic desaturases and DesA to identify and isolate plant oleic 
desaturases. 

25 The first roimd of PGR products are subjected to two rounds of 

subtraction using biotinylated fadS, fadD and fadE cloned cDNA to remove 
any hybridizing fadS, fadD and fadE sequences with strepavidin. This 
subtracted DNA is greatly enriched for fad2 sequences and depleted of fad3, 
fadD and fadE sequences. These 30 samples are run on agarose gels, 



SUBSTITUTE SHEET (RULE 25) 



wo 94/18337 



PCT/US94/01321 



-61- 



blotted and hybridized with pools of probe from the 30 samples. Pools of 5 
of each of the 30 PGR samples are labeled with random primers and 
hybridized to the blots of the 30 samples, for a toted of 6 blots hybridized 
with 6 pools of 5 probes. Additionally, a pool of fadS, fadD and fadB probe is 
5 hybridized to a duplicate blot. Bands that do not hybridize strongly to fadS, 
fadD and fadE but do cross hybridize to probe made from a different sample 
are strong candidates for fad2 as fad2 is likely to be the only DNA amplified 
in two or more independent PGR reactions. Positively hybridizing lanes 
identify samples to amplify by PGR using the same primers as in the initial 

10 reaction for 5 tolO cycles and the PGR products are cloned into plasmid 
vectors. The same probe that recognized the sample on the blot is used to 
screen the library and identify the hybridizing clone. Positive clones are 
sequenced and identified as fad2 clones by their homology but non-identity 
with fadS, and further characterized as described below. 

15 In the event that fad2 sequences are not stiSiciently enriched 

in one roimd of PGR to be identified, a second roxmd of PGR is performed. If 
the lack of detection is due to insiifficient amplification of fad2, then 
another round of PGR using the same primers on the subtracted PGR first 
round samples and the same simple screen as described above will identify 

20 fad2. If there are too many competing non-specific reactions then a second 
roimd of PGR using a different primer combination will remove non-specific 
amplifications and enrich for fad2. To further enrich for fad2 sequences 
each of the initial 30 PGR samples (one for each oligonucleotide in Table 7) 
after subtraction as described above, is subjected to a second roimd of PGR 

25 reactions using a different primer combination than the first reaction. One 
of the primers woiold be the same degenerate oligonucleotide primer as in 
the first PGR reaction. The second primer would now be from one of the 30 
primers in Table 7 from the opposite class, ie, primers from la to 5b form 
matched sets with primers from 6a to 8e (primers la to 5b are in the sense 
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direction while primers 6a to 8e are in the antisense direction). For 
example^ if oligonucleotide la was used initially^ it is used again as one of 
the two primers and the second primer is each of the 6a to 8e 
oligonucleotides for a total of 11 separate PGR reactions. In total the 30 
5 initial reactions result in 418 second cycle PGR reactions, a number easily 
handled by PGR technology. Essentially this second PGR cycle 
accomplishes a "nested" or sequential PGR reaction step after removing all 
the linoleic desaturases by the subtraction step. This increases the 
amplification as well as the specificity. Identification of samples containing 

10 fad2 are performed similarly as described above, with the 418 samples dot 
blotted onto 22 filters and probed with 21 pools of 20 samples and with a 
pool of fads, fadD and fsidE. Again, any sample that cross hybridizes with 
an independent probe sample and does not hybridize to fadS, fadD and fadE 
is a candidate for containing fad2 in the sample. If fadS, fadD and fadE 

15 hybridization is still present, another biotinylation/stepavidin subtraction 
should remove it. Positively hybridizing samples are run on gels, the band 
identified by hybridization and isolated for cloning. This second set of PGR 
reactions produces PGR products of a predictable size since both primers 
are within the coding region where little variation in size is expected. Thus 

20 the presence of a band of the expected size on a gel is diagnostic of fad2, 
particularly if hybridization of a blot of such a gel with a fadS, fadD and 
fadE probe indicates the band is not due to fadS, fadD and fadE 
contamination. After cloning the inserts in E. coli, the resulting plasmids 
containing the insert are identified by hybridization. They are sequenced 

25 and identified as oleic desatiirases by their homology but non-identity with 
the linoleic desaturases, as in the examples described previously. The full 
length clone of these cDNAs is obtained by standard methods and inserted 
into plant gene expression and transformation vectors and transformed 
into Arabidopsis fad2 mutants to confirm the identity of the oleic 



4 
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desaturase by genetic complemention as was described in the example with 
linoleic desaturase. 

Thus in this approach to isolating the plant oleic desaturases, 
the total number of peptide regions is 8, comprised of 28 smaller peptide 
5 targets. This leads to set of 30 degenerate oligonucleotides, that are used in 
the PGR amplification and screening of the PCR products. Subtraction of 
interfering fad3, fadD and fad£ sequences is used at several points. If 
necessary a second round of PCR reactions with paired internal primers 
gives extra amplification and specificity. This approach identifies the plant 

10 oleic desaturases, and the sequence of the isolated clones should confirm 
their identity by their homology to the plant linoleic desaturases as 
described. Thus a defined approach to isolating the plant oleic desaturases 
firom the information about linoleic desaturases is presented here. The 
example given here is for Arabidopsis or canola oleic desaturases, but the 

15 approach is not limited to those plants as the oleic desattirases are 
probably highly conserved in most plants. Thus once one plant oleic 
desaturase is isolated, the sequence information is used to isolate the genes 
fi*om other plant species by direct hybridization or by an approach similar 
to the one described here. 

20 
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SSQUENCE LISTING 

(1) GENERAL INFORMATION: 

<i) APPLICANT: 

(A) NAME: Honsaneo Company 

(B) STREET; 800 North Lindbergh Boulevard 

(C) CITY: St. Louie 

(D) STATE: Missouri 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP): 63167 
(6) TELEPHONE: (314)694-3131 
(H) TELEFAX: (314)694-5435 

(11) TITLE OF INVENTICMi: Altered Llnolenlc and Llnolelc Acid Content 
In Plants 

(111) NUMBER OF SEQUENCES: 72 

(Iv) CQMPOTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 (EPO) 

<vl> PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/156551 

(B) FILING DATE: 22-N0V-1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/014431 

(B) FILING DATE: 05-FEB-1993 

(2) INFORMATION FOR SEQ ID N0:1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: CDNA 

( Ix ) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 87. .1238 

(xl) S£QI;ENCE DESCRIPTION: SEQ ID NO:l: 
AATCCATCAA ACCTTTATTC ACCACATTTC ACTGAAAGGC CACACATCTA 6AGAGAGAAA 60 
CTTCGTCCAA ATCTCTCTCT CCAGCG ATG GTT GTT 6CT ATG GAC CAG CGC AGC 113 
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Met Val Val Ala Hot. Asp Gin Arg Ser 
1 5 

AAT GTT AAC GGA GAT TCC GGT 6CC CGG AAG GAA GAA 6GG TTT GAT CCA 161 
Asn Val Asn Gly Aep Ser Gly Ala Arg Lys Glu Glu Gly Phe Aep Pro 
10 15 20 25 

AGC GCA CAA CCA CCG TTT AAG ATC GGA GAT ATA AGG GCG 6CG ATT CCT 209 
Ser Ala Gin Pro Pro Phe Lys He Gly Asp He Arg Ala Ala lie Pro 

30 35 40 

AAG CAT TGC TGG GTG AAG AGT CCT TTG AGA TCT ATG AGC TAC GTC ACC 257 
Lye His Cys Trp Val Lys Ser Pro Leu Arg Ser Met Ser Tyr Val Thr 

45 50 55 

AGA GAC ATT TTC GCC GTC GCG GCT CTG GCC ATG GCC GCC GTG TAT TTT 305 
Arg Asp He Phe Ala Val Ala Ala Leu Ala Met Ala Ala Val Tyr Phe 
60 &5 70 

GAT AGC TGG TTC CTC TGG CCA CTC TAC TGG GTT GCC CAA GGA ACC CTT 353 
Asp Ser Trp Phe Leu Trp Pro Leu Tyr Trp Val Ala Gin Gly Thr Leu 
75 80 85 

TTC TGG GCC ATC TTC GTT CTT GGC CAC GAC T6T GGA CAT G6G AGT TTC 401 
Phe Trp Ala He Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe 
90 95 100 105 

TCA GAC ATT CCT CTG CTG AAC AGT GTG GTT GGT CAC ATT CTT CAT TCA 449 
Ser Asp He Pro Leu Leu Asn Ser Val Val Gly His He Leu His Ser 

110 115 120 

TTC ATC CTC GTT CCT TAC CAT GGT TGG AGA ATA AGC CAT CGG ACA CAC 497 
Phe He Leu Val Pro Tyr His Gly Trp Arg He Ser His Arg Thr His 

125 130 135 

CAC CAG AAC CAT GGC CAT GTT GAA AAC GAC GAG TCT TGG GTT CCG TTG 545 
His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp Val Pro I^u 
140 145 150 

CCA GAA AAG TTG TAC AAG AAC TTG CCC CAT AGT ACT CGG ATG CTC AGA 593 
Pro Glu Lys Leu Tyr Lys Asn Leu Pro His ser Thr Arg Met I«eu Arg 
155 160 165 

TAC ACT GTC CCT CTC CCC ATG CTC GCT TAC CCG ATC TAT CTG TGG TAC 641 
Tyr Thr Val Pro Leu Pro Met Leu Ala Tyr Pro He Tyr Leu Trp Tyr 
170 175 180 185 

AGA AGT CCT GGA AAA GAA GGG TCA CAT TTT AAC CCA TAC AGT AGT TTA 689 
Arg Ser Pro Gly Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser X«eu 

190 195 200 

TTT GCT CCA AGC GAG AGG AAG CTT ATT GCA ACT TCA ACT ACT TGC TGG 737 
Phe Ala Pro Ser Glu Arg Lys Leu He Ala Thr Ser Thr Thr Cys Trp 

205 210 215 

TCC ATA ATG TTG GCC ACT CTT GTT TAT CTA TCG TTC CTC GTT GAT CCA 785 
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Ser lie Met X^u JVla Thr Leu Val Tyr Leu Ser Phe Leu Val Asp Pro 
220 225 230 

GTC ACA GTT CTC AAA GTC TAT GGC GTT CCT TAG ATT ATC TTT GTG ATG 833 

Val Thr Val Leu Lye Val Tyr Gly Val Pro Tyr lie lie Phe Val Met 
235 240 245 

TGG TTG GAC GCT GTC ACG TAG TTG CAT CAT CAT GGT CAC GAT GAG AAG 881 
Trp Leu Asp Ala Val Thr Tyr Leu Hie His Bis Gly His Asp Glu Lys 
250 255 260 265 

TTG CCT TGG TAG AGA GGC AAG GAA TGG AGT TAT TTA CGT GGA GGA TTA 929 
Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr I«eu Arg Gly Gly Leu 

270 275 280 

ACA ACT ATT GAT AGA GAT TAC GGA ATC TTC AAC AAC ATC CAT CAC GAC 977 
Thr Thr lie Asp Arg Asp Tyr Gly lie Phe Asn Asn lie His His Asp 

285 290 295 

ATT GGA ACT CAC GTG ATC CAT CAT CTT TTC CCA CAA ATC CCT CAC TAT 1025 
lie Gly Thr His Val He His His Leu Phe Pro Gin He Pro His Tyr 
300 305 310 

CAC TTG GTC GAT GCC ACG AGA GGA GCT AAA CAT GTG TTA GGA AGA TAC 1073 
His Leu Val Asp Ala Thr Arg Ala Ala Lys His Val Leu Gly Arg Tyr 
315 320 325 

TAC AGA GAG CCG AAG ACG TCA GGA GCA ATA CCG ATT CAC TTG GTG GAG 1121 
Tyr Arg Glu Pro Lys Thr Ser Gly Ala He Pro He His Leu Val Glu 
330 335 340 345 

AGT TTG GTC GCA AGT ATT AAA AAA GAT CAT TAC GTC AGT GAC ACT GGT 1169 
Ser Leu Val Ala Ser He Lys Lys Asp His Tyr Val Ser Asp Thr Gly 

350 355 360 

GAT ATT GTC TTC TAC GAG ACA GAT CCA OAT CTC TAC GTT TAT GCT TCT 1217 
Asp He Val Phe Tyr Glu Thr Asp Pro Asp Leu Tyr Val Tyr Ala Ser 

365 370 375 

GAC AAA TCT AAA ATC AAT TAACTTTTCT TCCTAGCTCT ATTAGGAATA 1265 
Asp Lys Ser Lys He Asn 
380 

AACACTCCTT CTCTTTTACT TATTTGTTTC TGCTTTAAGT TTAAAATGTA CTCGTGAAAC 1325 
CTTTTTTTTA TTAATGTATT TACGTTAC 1353 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 
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(xi) SBQUBKCE DESCRIPTION x SEQ ID NO: 2 s 

Met Val Val Ala Met Asp Gin Arg Ser Aen Val Asn Gly Asp Ser ly 
1 5 10 15 

Ala Arg Lys Glu Glu Gly Phe Asp Pro Ser Ala Gin Pro Pro Phe Lye 

20 25 30 

lie Gly Asp lie Arg Ala Ala He Pro Lys His Cys Trp Val Lys Ser 
35 40 45 

Pro Leu Arg Ser Met Ser Tyr Val Thr Arg Asp He Phe Ala Val Ala 
50 55 60 

Ala Leu Ala Met Ala Ala Val Tyr Phe Asp Ser Trp Phe Leu Trp Pro 
65 70 75 80 

Leu Tyr Trp Val Ala Gin Gly Thr Leu Phe Trp Ala He Phe Val Leu 

S5 90 95 

Gly His Asp Cys Gly His Gly Ser Phe Ser Asp He Pro Leu Leu Asn 

100 105 110 

ser val Val Gly His lie Leu His Ser Phe He Leu Val Pro Tyr His 
115 120 125 

Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His Val 
130 135 140 

Glu Asn Asp Glu Ser Trp Val Pro Leu Pro Glu Lys Leu Tyr Lys Asn 
145 150 155 160 

Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Val Pro Leu Pro Met 

165 170 175 

Leu Ala Tyr Pro He Tyr Leu Trp Tyr Arg Ser Pro Gly Lys Glu Gly 

ISO 185 190 

ser His Phe Asn Pro Tyr Ser Ser Leu Phe Ala Pro Ser Glu Arg Lys 
195 200 205 

Leu He Ala Thr Ser Thr Thr Cys Trp Ser He Met Leu Ala Thr Leu 
210 215 220 

val Tyr Leu Ser Phe Leu Val Asp Pro Val Thr Val Leu Lye Val Tyr 
225 230 235 240 

Gly val Pro Tyr He He Phe Val Met Trp Leu Asp Ala Val Thr Tyr 

245 250 255 

Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr Arg Gly Lys 

260 265 270 

Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr He Asp Arg Asp Tyr 
275 280 285 
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Cly He Phe Asn Aan 
290 



He Hie Hie Aflp lie Gly Thr Hie Val He Hie 
295 300 



Hie Leu Phe Pro Gin He Pro Hie Tyr His l^u Val Aep Ala Thr Arg 
305 310 315 320 



Ala Ala Lye Hie Val Leu Gly Arg Tyr Tyr Arg Glu Pro Lye Thr Ser 

330 335 



325 



Gly 



Ala He Pro He Hie Leu Val Glu Ser Leu Val Ala ser He Lye 



340 



345 



350 



Lye ABp Hie Tyr Val Ser Asp Thr Gly Aep He Val Phe Tyr Glu Thr 
' 360 365 



355 



ABP Pro Asp Leu Tyr Val Tyr Ala Ser Aep Lye Ser Lye He Aen 

375 380 



370 



(2) INFORMATION FOR SEQ ID HO:3: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 25 baee paire 

(B) TYPE: nucleic acid 

(C) STRl^EDNESS; e ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGCGAT6CTG TCGGAATGGA CGATA 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base paire 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: Single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTTGGAGCCA CTATCGACTA CGCGATC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 baee paire 

(B) TYPE: nucleic acid 
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( C ) STRANDECNESS x e Ingle 

(D) TOPOLOGY: linear 

(ii) MOLBCOI^ TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
CCGATCTCAA 6ATTATCGAA T 
(2) INFOKMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 24 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTCCTAAT6C AG6A6TC6CA TAAG 
(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGGAGTCGCA TAAG6GAG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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GGGAAGTGAA TGGAGAC 17 
(2) INFORH21TION FOR SEQ ID NOt9s 

(1) SEQUEHCE CHARACTERISTICS: 

(A) I£NGTH: 1645 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dottble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

<A) NAME/KEY: CDS 

(B) LOCATION: 125.. 1465 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

6GAAAACACA A6TTTCTCTC ACACACATTA TCTCTTTCTC TATTACCACC ACTCATTCAT 60 

AACAGAAACC CACCAAAAAA TAAAAAGAGA GACTTTTCAC TCTGG6GA6A GAGCTCAAGT 120 

TCTA AT6 GC6 AAC TT6 GTC TTA TCA GAA TGT GGT ATA CGA CCT CTC CCC 169 
Met Ala Asn Leu Val Leu Ser Glu Cys Gly lie Arg Pro Leu Pro 
15 10 15 

AGA ATC TAG ACA ACA CCC AGA TCC AAT TTC CTC TCC AAC AAC AAC AAA 217 
Arg lie Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Aen Aen Asn Lye 

20 25 30 

TTC AGA CCA TCA CTT TCT TCT TCT TCT TAC AAA ACA TCA TCA TCT CCT 265 
Phe Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lye Thr Ser Ser Ser Pro 

35 40 45 

CTG TCT TTT GGT CTG AAT TCA CGA GAT GGG TTC ACG AGG AAT TGG GCG 313 
Leu Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Ala 
50 55 60 

TTG AAT GTG AGC ACA CCA TTA ACG ACA CCA ATA TTT GAG GAG TCT CCA 361 
Leu Asn Val Ser Thr Pro Leu Thr Thr Pro lie Phe Glu Glu Ser Pro 
65 70 75 

TTG GAG GAA GAT AAT AAA CAG AGA TTC GAT CCA GGT GCG CCT CCT CCG 409 
Leu Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Ala Pro Pro Pro 
80 85 90 95 

TTC AAT TTA GCT GAT ATT AGA GCA GCT ATA CCT AAG CAT TGT TGG GTT 457 
Phe Asn Leu Ala Asp lie Arg Ala Ala lie Pro Lys His Cys Trp Val 

100 105 110 

AAG AAT CCA TGG AAG TCT TTG AGT TAT GTC GTC AGA GAC GTC GCT ATC 505 
Lys Asn Pro Trp Lys Ser Leu Ser Tyr Val Val Arg Asp Val Ala lie 

115 120 125 
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GTC TTT GCA TT6 6CT GCT GGA OCT OCT TAC CTC HAC AAT T6G ATT GTT 553 
Val Phe Ala I«eu AXa AXa Gly Ala Ala Tyr Leu Asn Asn Trp lie Val 
X30 135 140 

TGG CCT CTC TAT TGG CTC GCT CAA GGA ACC ATG TTT TGG GCT CTC TTT 601 
Trp Pro Leu Tyr Trp Xreu Ala Gin Gly Thr Met Phe Trp Ala Leu Phe 
145 150 155 

GTT CTT GGT CAT 6AC TGT GGA CAT GGT A6T TTC TCA AAT GAT CCG AA6 649 
Val Leu Gly Hie Asp Cys Gly Hie Gly Ser Phe Ser Asn Asp Pro Lye 
160 165 170 175 

TTG AAC AGT GTG GTC GGT CAT CTT CTT CAT TCC TCA ATT CTG GTC CCA 697 
Leu Asn Ser Val Val Gly His Leu Leu His Ser Ser lie Leu Val Pro 

180 165 190 

TAC CAT GGC TGG AGA ATT AGT CAC AGA ACT CAC CAC CAG AAC CAT GGA 745 
Tyr His Gly Trp Arg lie Ser His Arg Thr His His Gin Asn His Gly 

195 200 205 

CAT GTT GAG AAT 6AC GAA TCT TGG CAT CCT ATG TCT GAG AAA ATC TAC 793 
His Val Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lye lie Tyr 
210 215 220 

AAT ACT TTG GAC AAG CCG ACT AGA TTC TTT AGA TTT ACA CTG CCT CTC 841 
Asn Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu 
225 230 235 

GTG ATG CTT GCA TAC CCT TTC TAC TTG TGG GCT CGA AGT CCG GGG AAA 889 
Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys 
240 245 250 255 

AAG GGT TCT CAT TAC CAT CCA GAC AGT GAC TTG TTC CTC CCT AAA GAG 937 
Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe X«eu Pro Lys Glu 

260 265 270 

AGA AAG GAT GTC CTC ACT TCT ACT CCT TGT TGG ACT GCA ATG GCT GCT 985 
Arg Lys Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Ala 

275 280 285 

CTG CTT GTT TGT CTC AAC TTC ACA ATC GGT CCA ATT CAA ATG CTC AAA 1033 
Leu Leu Val Cys Leu Asn Phe Thr lie Gly Pro lie Gin Met Leu Lys 
290 295 300 

CTT TAT CGA ATT CCT TAC TGG ATA AAT GTA ATG TGG TTG GAC TTT GTG 1081 
Leu Tyr Gly lie Pro Tyr Trp lie Asn Val Met Trp Leu Asp Phe Val 
305 310 315 

ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG CTT CCT TGG TAC CGT 1129 
Thr Tyr Leu His His Hie Gly His Glu Asp Lys Leu Pro Trp Tyr Arg 
320 325 330 335 

GGC AAG GAG TGG AGT TAC CTG AGA GGA GGA CTT ACA ACA TTG GAT CGT 1177 
Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg 

340 345 350 
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6AC TAG GGA TTG ATC AAT AAC ATC CAT CAT GAT ATT GGA ACT CAT GTG 1225 
Asp Tyr Gly Leu He Asn Asn lie His His Asp He Gly Thr His Val 

355 360 365 

ATA CAT CAT CTT TTC COG CAG ATC CCA CAT TAT CAT CTA 6TA GAA GCA 1273 
He His His Leu Phe Pro Gin He Pro His Tyr His Leu Val Glu Ala 
370 375 380 

ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT TAC AGG GAG CCT GAT 1321 
Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr Tyr Arg Glu Pro Asp 
385 390 395 

AAG TCT GGA COG TTG CCA TTA CAT TTA CTG GAA ATT CTA GOG AAA AGT 1369 
Lys Ser Gly Pro Leu Pro I«eu His Leu Leu Glu He Leu Ala Lys Ser 
400 405 410 415 

ATA AAA GAA GAT CAT TAC GTG AGO GAC GAA GGA GAA GTT GTA TAC TAT 1417 
He Lys Glu Asp His Tyr Val Ser Asp Glu Gly Glu Val Val Tyr Tyr 

420 425 430 

AAA GCA GAT CCA AAT CTC TAT GGA GAG GTC AAA GTA AGA GCA GAT TGAAATGAAG 
1472 

Lys Ala Asp Pro Asn Leu Tyr Gly Glu Val Lys Val Arg Ala Asp 

435 440 445 

CAG6CTTGAG ATTGAAGTTT TTTCTATTTC AGACCAGCTG ATTTTTTGCT TACTGTATCA 1532 

ATTTATTGTG TCACCCACCA GAGAGTTA6T ATCTCTGAAT ACGATOGATC AGATGGAAAC 1592 

AACAAATTTG TTTGOGATAC T6AAGCTATA TATACCATAA AAAAAAAAAA AAA 1645 



(2) INPOBMATION FOR SEQ ID NOslO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Het Ala Asn Leu Val Leu Ser Glu Cys Gly He Arg Pro Leu Pro Arg 
15 10 15 

He Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys Phe 

20 25 30 

Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser Ser Pro Leu 
35 40 45 

Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Ala Leu 
50 55 60 

Asn Val S r Thr Pro Leu Thr Thr Pro H Phe Glu Glu Ser Pro Leu 
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65 70 75 80 

Glu Clu Asp Abo Zys Gin Arg Phe Aep Pro Gly Ala Pro Pro Pro Phe 

as 90 95 

Asn Leu Ala Asp lie Arg Ala Ala lie Pro Lys His Cys Trp Val Lys 

100 105 110 

Asn Pro Trp Lys Ser Leu Ser Tyr Val Val Arg Asp Val Ala He Val 
115 120 125 

Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu Asn Asn Trp He Val Trp 
130 135 140 

Pro Leu Tyr Trp Leu Ala Gin Gly Thr Met Phe Trp Ala Leu Phe Val 
145 150 155 160 

Leu Gly Bis Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys Leu 

165 170 175 

Asn Ser Val Val Gly His I«eu Leu His Ser Ser He Leu Val Pro Tyr 

180 185 190 

His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His 
195 200 205 

Val Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys He Tyr Asn 
210 215 220 

Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu Val 
225 230 235 240 

Met Leu Ala Tyr Pro Phe Tyr Leu Tarp Ala Arg Ser Pro Gly Lys Lye 

245 250 255 

Gly Ser His Tyr Hie Pro Asp Ser Asp Leu Phe Leu Pro Lye Glu Arg 

260 265 270 

Lye Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Ala Leu 
275 280 285 

Leu Val Cys Leu Asn Phe Thr He Gly Pro He Gin Met Leu Lys Leu 
290 295 300 

Tyr Gly He Pro Tyr Trp He Asn Val Met Trp Leu Asp Phe Val Thr 
305 310 315 320 

Tyr Leu His Hie Hie Gly Hie Glu Asp Lys Leu Pro Trp Tyr Arg Gly 

325 330 335 

Lye Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg Asp 

340 345 350 

Tyr Gly L.eu He Asn Asn He His His Asp He Gly Thr His Val He 
355 360 365 
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His His I«eu Phe Pro Gin lie Pro His Tyr His I^u Val Glu Ala Thr 
370 375 380 

Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr Tyr Arg Glu Pro Asp Lys 
385 390 395 400 

Ser Gly Pro Leu Pro Leu His Leu Leu Glu lie Leu Ala Lys Ser lie 

405 410 415 

Lys Glu Asp His Tyr Val Ser Asp Glu Gly Glu Val Val Tyr Tyr Lys 

420 425 430 

Ala Asp Pro Asn Leu Tyr Gly Glu Val Lys Val Arg Ala Asp 
435 440 445 



(2) INFORMATION FOR SEQ ID NO: 11$ 

(i) SEQtTENCE CHARACTERISTICS: 

(A) LENGTH: 1525 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) HOLECOLE TYPE: cDNA 



( ix ) FEATURE : 

<A) NAME /KEY: CDS 

(B) LOCATION: 61.. 1368 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGAGAGTGCA AATAGAACGA CAGAGACTTT TTCCTCTTTT CTTCTTGGGA AGAGGCTCCA 60 

ATG GC6 AGC TCG 6TT TTA TCA GAA TGT GGT TTT A6A CCT CTC CCC AGA lOB 
Met. Ala Ser Ser Val Leu Ser Glu Cys Gly Phe Arg Pro Leu Pro Arg 
15 10 15 

TTC TAG CCT AAA CAC ACA ACC TOT TTT GCC TCT AAC CCT AAA CCC ACT 156 
Phe Tyr Pro Lys His Thr Thr Ser Phe Ala Ser Asn Pro Lys Pro Thr 

20 25 30 

TTC AAA TTC AAT CCA CCA CTT AAA CCT CCT TCT TCT CTT CTC AAT TCC 204 
Phe Lys Phe Asn Pro Pro Leu Lys Pro Pro Ser Ser Leu Leu Asn Ser 
35 40 45 

CGA TAT GGA TTC TAG TCT AAA ACC AGG AAC TGG GCA TTG AAT GTG GCA 252 
Arg Tyr Gly Phe Tyr Ser Lys Thr Arg Asn Trp Ala Leu Asn Val Ala 
50 55 60 

ACA CCT TTA ACA ACT CTT CAG TCT CCA TCC GAG GAA OAC ACG GAG AGA 300 
Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Thr Glu Arg 
65 70 75 80 
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TTC GAC CCA 6GT GCG CCT CCT CCC TTC AAT TTG 6CG GAT ATA AGA GCA 348 

Phe Asp Pro Gly Ala Pro Pro Pr Phe Asn Z«eu Ala Aep lie Arg Ala 

85 90 95 

GCC ATA CCT AAG CAT TGT TGG GTT AAG AAT CCA TGG ATG TCT ATG AGT 396 
Ala lie Pro Lye His Cys Trp Val Lye Aen Pro Trp Met Ser Met Ser 

100 105 110 

TAT GTT GTC AGA GAT GTT GCT ATC GTC TTT GGA TTG GCT GCT GTT GCT 444 
Tyr Val Val Arg Asp Val Ala lie Val Phe Gly Leu Ala Ala Val Ala 
115 120 125 

GCT TAG TTC AAC AAT TGG CTT CTC TGG CCT CTC TAC TGG TTC GCT CAA 492 
Ala Tyr Phe Asn Asn Trp I«eu Leu Trp Pro I«eu Tyr Trp Phe Ala Gin 
130 135 140 

GGA ACC ATG TTC TGG GCT CTC TTT GTC CTT GGC CAT GAC TGC GGA CAT 540 
Gly Thr Met Phe Trp Ala Leu Phe Val Leu Gly His Asp eye Gly His 
145 150 155 160 

GGT AGC TTC TC6 AAT GAT CC6 AGG CTG AAC AGT GTG GCT GGT CAT CTT 588 
Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Val Ala Gly His Leu 

165 170 175 

CTT CAT TCC TCA ATT CTG GTC CCT TAC CAT GGC TGG AGG ATT AGC CAC 636 
Leu His Ser Ser lie Leu Val Pro Tyr His Gly Trp Arg lie Ser His 

180 185 190 

AGA ACT CAC CAC CAG AAC CAT GGT CAT GTC GAG AAT GAC 6AA TCA TGG 684 
Arg Thr His His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp 
195 200 205 

CAT CCT TTG CCT GAA AGC ATC TAC AAG AAT TTG GAA AAG ACG ACT CAA 732 
His Pro Leu Pro Glu Ser lie Tyr Lye Asn Leu Glu Lys Thr Thr Gin 
210 215 220 

ATG TTT AGG TTT ACA CTG CCT TTT CCA ATG CTC GCA TAC CCT TTC TAC 780 
Met Phe Arg Phe Thr Leu Pro Phe Pro Met Leu Ala Tyr Pro Phe Tyr 
225 230 235 240 

TTG TGG AAC AGA AGT CCA GGG AAA CAA GGT TCT CAT TAT CAT CCG GAC 828 
Leu Trp Asn Arg Ser Pro Gly Lys Gin Gly Ser His Tyr His Pro Asp 

245 250 255 

AGT GAC TTG TTT CTT CCA AAA GAG AAG AAA GAT GTT CTG ACA TCA ACT 876 
Ser Asp Leu Phe Leu Pro Lys Glu Lys Lys Asp Val Leu Thr Ser Thr 

260 265 270 

GCC TGT TGG ACT GCA ATG GCT GCT TTG CTT GTT TGT CTC AAC TTT GTC 924 
Ala Cys Trp Thr Ala Met Ala Ala Leu Leu Val Cys Leu Asn Phe Val 
275 260 285 

ATG GGT CCA ATC CAG ATG CTC AAA CTA TAT GGC ATC CCT TAT TGG ATA 972 
Met Gly Pro lie Gin Met Leu Lys Leu Tyr Gly lie Pro Tyr Trp lie 
290 295 300 
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TTT GTA ATG TGG TTG GAG TTC GTC ACT TAC TTG CAC CAC CAT GGA CAT 1020 

Phe Val Met Tzrp Leu Asp Ph Val Thr Tyr Leu Hie Hie Hie Gly Hie 
305 310 315 320 

GAA GAC AAG CTC CCT TGG TAT CGT GGA AAG GAA TGG AGT TAC CTG AGA 1068 
Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lye Glu Trp Ser Tyr Leu Arg 

325 330 335 

GGA GGG CTC ACA ACA TTA GAT CGT GAC TAC GGA TGG ATC AAT AAC ATC 1116 
Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp lie Asn Aen lie 

340 345 350 

CAC CAC GAT ATT GGA ACT CAT GTG ATA CAT CAT CTT TTC CCG CAG ATC 1164 
His His Asp lie Gly Thr His Val He Hie Hie Leu Phe Pro Gin He 
355 360 365 

CCA CAT TAT CAT CTA GTA GAA GGA ACA GAA CCA GCT AAA CCA GTA CTA 1212 
Pro Hie Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lye Pro Val Leu 
370 375 380 

GGA AAG TAC TAC AGA GAA CCG AAA AAC TCT GGA CCT CTG CCA CTT CAC 1260 
Gly Lye Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro Leu Hie 
385 390 395 400 

TTA CTG GGA A6C CTC ATA AAG AGT ATG AAA CAA GAC CAT TTC GTA AGC 1308 
Leu Leu Gly Ser Leu He Lye Ser Met Lys Gin Asp His Phe Val Ser 

405 410 415 

GAT ACA GGA GAT GTC GTG TAC TAT GAG GCA GAT CCA AAA CTC AAT GGA 1356 
Asp Thr Gly Asp Val Val Tyr Tyr Glu Ala Asp Pro Lys Leu Asn Gly 

420 425 430 

CAA AGA ACA TGAGGACATA CTGCAGTGAA CCAGGCAGAC AAGTTACATA 1405 

Gin Arg Thr 
435 

AATTCATCTT GGCCCATTGA TTATGTTCTT TTTGTTTTG6 TGTAAAGCCT TTTCGAGATT 1465 
AAAAAAGCAT TAATTTGTAG AAACCTGTGG TAAAACTCTC GATCAAATGA AATAAGATAT 1525 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 435 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Ser Ser Val Leu Ser Glu Cys Gly Phe Arg Pro Leu Pro Arg 
15 10 15 
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Phe Tyr Pro I-ya Hi- Thr Thr Ser Phe Ala Ser luin Pro l,y- Pro Thr 



20 25 



Phe Lye Phe Aan Pro Pro Lye Pro Pro Ser Ser Leu Leu Asn 

35 *0 
Arg Tyr Gly Phe Tyr Ser Lye Thr Arg Asn Trp Ala Leu Aen Val Ala 

50 55 

Thr pro I^u Thr Thr Leu Gin Ser Pro Ser Gl« Glu A»p Thr Glu Arg 
65 70 75 

Phe ABp pro Gly Ala Pro Pro Pro Phe Asn Leu Ala Aep He Arg Ala 

85 



Ala 



He pro Lye His Cys Trp Val Lye Asn Pro Trp Met Ser Met 



100 



105 



Tyr val Val Arg Asp Val Ala He Val Phe Gly Leu Ala Ala Val Ala 
115 "0 



Phe Asn Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Ala Gin 

130 



Ala Tyr Phe Asn Asn Trp Leu Leu xrp *-ir« .x- 

135 



145 



150 ^55 



Gly Thr Met Phe Trp Ala Leu Phe val Leu Gly Hie Asp Cys Gly His 

Gly ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Val Ala Gly His Leu 

165 

Leu His ser Ser He Leu Val Pro Tyr His Gly Trp Arg lie ser His 

180 

Arg Thr His His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp 
195 200 205 



His Pro Leu Pro Glu Ser lie Tyr Lye Asn Leu Glu Lys Thr Thr Gin 
210 215 220 

Met Phe Arg Phe Thr Leu Pro Phe Pro Het Leu Ala Tyr Pro Phe Tyr 
225 230 235 

Leu Trp Asn Arg Ser Pro Gly Lys Gin Gly ser His Tyr His Pro Asp 

245 250 

ser ASP Leu Phe Leu Pro Lys Glu Lys Lys Asp val Leu Thr Ser Thr 

260 

Ala cys Trp Thr Ala Met Ala Ala Leu Leu Val Cys Leu Asn Phe Val 
275 280 

Met Gly pro He Gin Met Leu Lys Leu Tyr Gly lie Pro Tyr Trp He 
290 295 300 

Phe val Met Trp Leu Asp Phe Val Thr Tyr Leu His His His Gly His 
305 310 315 
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Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg 

325 330 335 

Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp He Asn Asn He 

340 345 350 

His His Asp He Gly Thr His Val He His His Leu Phe Pro Gin He 
355 360 365 

Pro His Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu 
370 375 380 

Gly Lys Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro Leu His 
385 390 395 400 

Leu Leu Gly Ser Leu He Lys Ser Met Lys Gin Asp His Phe Val Ser 

405 410 415 

Asp Thr Gly Asp Val Val Tyr Tyr Glu Ala Asp Pro Lys Leu Asn Gly 

420 425 430 



Gin Arg Thr 
435 



(2> INFORKATION FOR SEQ ID NO: 13: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAYATHHGNG CNGCNATHCC 20 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCNATHCCNA ARCAYTG 17 
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(2) XNFOHMXTXON FOH SEQ ID MO: 15s 

<i) SEQUENCE CHARACTERZSTZCS: 

(A) LENGTH: 17 base pairs 

(B) nPE: nucleic acid 

(C) STRANDEDNESSt single 

(D) T0P0I.06Y; linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
AARCAyTGYT GGGTNAA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGGTTYYTNT GGCCNYTNTA YTGG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGGTTYYTNT GGCCN 

(2) INFORMATION FOR SEQ ID NO: 18: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECtTI^ TITPB: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: 5EQ ID NOslS: 
TGGCCMrrNT AYTGG 15 
(2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHlkRACTERISTICS: 

(A) LENGTH: 17 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSSs single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
TGGGTNGCNC AR6GNAC 17 
(2) INFORMATION FOR SEQ ID NO: 20: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTYGTNYTNG 6NCAYGA 17 

(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
6TNYTNGGNC AYGAYTG 17 
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(2) IKTORMATION FOR SEQ ID NO: 22: 

(i) SBQUENCE CHARACTERISTICS x 

(A) IAN6TH: 17 baee paire 

(B) TYPE} nucleic acid 
<C) STRANDEDKESS: single 
(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: DNA (eynthetic) 



(xi) SEQtTENCE DESCRIPTION: SEQ ID NO: 22: 
GGNCAYGAYT 6YGGNCA 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CEiARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGYGGNCAYG CNWSNTT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCNTAYCAYG GNTGG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPK: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTXON: SEQ ID NO: 25: 

CAYGGNTGGM GNATHWSNCA 20 

(2) INFORMATION FOR SEQ ID NO:26: 

(1) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 26 base pairs 
(8) TTPE: nucleic acid 

(C) STRANDEDNE8S: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (Sjrnthetlc) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TG6MGNATHT CNCAYHGNAC NCAYCA 26 
(2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
TGGMGNATHA GYCAYHGNAC NCAYCA 26 
(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SEQUENCE CHARACn^RISTICS: 

(A) LENGTH: X5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
T6GM6NATHW SNCAY 15 
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(2) INFORMIITION FOR SEQ ID NOs29: 

(i) SEQOENCE CHARACTERZSTXCS : 

(A) LENGTHS 15 base p&irft 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA < synthetic) 



<xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 29: 

CAYMGNACNC AYCAY 15 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GARAAY6AYG ARWSNT(»3 18 
(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENC:E DESCRIPTION: SEQ ID NO: 31: 
GAYGARWSNT (^(SGTNCC 17 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 



PCT/US94/01321 



-89- 

(ii) MOLECULE TYPE: DMA (8ynt:hetic> 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

NGTNACNGCR TCNARCCA 18 

<2) INFORMATION FOR SEQ ID NO: 33: 

(1) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 15 base pairs 
(B) TYPE: nucleic acid 
{C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: ONA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
RTCRTGNARR TANGT 15 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ARNCCNCCNC KNARRTARCT CCA 23 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ARNCCNCCNC KNARRTANGA CCA 23 
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(2) INFORMATION FOR SSQ ID NO: 36: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 base pairs 

(B) TyPB: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY s linear 

<ii) MOLECULE TYPE: DMA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
RTCNCKRTCD ATNGTNGTMA 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
RTARTCNCKR TCDATNGT 
(2) INFORMATION FOR SEQ ID NO: 38: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
RTGNGTNCCD ATRTCRTG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) .STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (•ynthetic) 

(Xi) SEQUENCE DESCRIPTION! SEQ ID NOt39: 
HABRTGRTGD ATNACRTG 
(2) INPORMATICW FOR SEQ ID NOj40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
DATYTGNGGR AANARRTGRT G 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGDATYTGNG GRAANARRTG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
RTARTGNGGD ATYTGNGGRA ANA 
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(2) INFORKATION FOR SBQ ID NO s 43* 
(1) SBQOBNCE CHMIACTERISTICS: 

<A) X«ENGTHs 7 amino acids 
(B) TTPBs affllno acid 
CD) TOP0X«OG¥: linear 

(ii) MOLBCULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO s 43s 

Asp He Arg Ala Ala He Pro 
1 5 

{2} INFORKATION FOR SBQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ala He Pro Lys His Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO:4S: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Lys His Cys Trp Val Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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<xi) SEQUENCE DESCRIPTION t SEQ ID NO i 46s 
Trp Phe Leu Trp Pro Leu Tyr Trp 



(i) 5EQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Trp Phe Leu Trp Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 48: 

<i) SEQtTENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Trp Pro Leu Tyr Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Trp Val Ala Gin Gly Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 



1 



5 



(2) INFORMATION FOR SEQ ID NO: 47: 
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(i) SEQUENCE CHARACTERISTICS t 
(A) I.BNGTH: 6 amino acids 
<B) XyPEi amino acid 
(D) TOPOLOGY: linear 

(ii) HOJSBCaUB TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 
Trp Val Ala Gin Gly Thr 



1 5 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Val Leu Gly His Aep Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Gly His Asp Cys Gly His 
1 S 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



m 
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(xi) SEQUENCE description: SEQ ID NO:S3: 

eye Gly His Gly S r Phe 



1 5 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS « 

(A) LENGTH: 5 amino acids 
' (B) TyPE: amino acid 
(D) TOPOLOGY J linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

Pro Tyr His Gly Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

His Gly Trp Arg He Ser His 
1 S 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
Trp Arg He Ser His Arg Thr His His 



1 S 



(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS 
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(A) I»BNGTH: 5 amino acide 

(B) TYPES amino acid 
(D> TOPOIiOGYi linear 

(11) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOt57t 



Trp Arg He Ser His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 58: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 



Hie Arg Thr His His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 59: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 



xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
Glu Asn Asp Glu Ser Trp 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 60: 

Asp Glu ser Trp Val Pro 

1 5 

(2) INF03RMATI0N FOR SEQ ID NO: 61: 

{i> SEQUENCE CEUlRACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Trp Leu Asp Ala Val Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 5 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Thr Tyr Leu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 63: 

(1) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 8 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Trp Ser Tyr Leu Arg Gly Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) Z^NGTB: 7 amino acldB 

(B) RPE: amino acid 
(D) TOPOMGY: linear 

(ii) HOUCDLE TYPE: peptide 



(xi) SEQUENCE DESCRXP7I0N: SEQ ID N0s64: 

Leu Thr Thr lie Kap Arg Aep 
1 5 

(2) INFORMATION FOR SEQ ID NOs65t 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: i>eptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Thr lie Asp Arg Asp Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO:S6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii} MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

HiB Asp lie Gly Thr His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SSQUENCE DESCRIPTIONS SEQ ID HO: 67s 
His Val He His His Leu 

(2) INFORMATION FOR SEQ ID NOs68; 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 7 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

His His Leu Phe Pro Gin He 
1 5 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

His Leu Phe Pro Gin He Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I«BNGTH: 6 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Phe Pro Gin He Pro His Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 
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<A) Z^MGTH: 1670 base palre 

(B) TYPES nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 



(Ix) FEATORS: 

(A) NAME/KEY: CDS 

(B) LOCATION: 46. « 1302 



(xl) SEQUENCE DESCRIPTION: SBQ ID NO: 71: 
CAAACTCTCT CGGGGGGTCG CTTCTTCTCC ATTTTCTGCT TCCCA ATG 6CT TCC 54 

Met Ala Ser 
1 

AGA ATT OCT GAT TOT CTC TTC GCC TTC ACG GGC CCA CAG CAA TGT CTT 102 
Arg lie Ala Asp Ser Leu Phe Ala Phe Thr Gly Pro Gin Gin Cye I^eu 
5 10 15 

CCT AGG 6TT CCT AAG CTT GCT GCT TCT TCT 6CT CGT 6TT TCT CCT GGT 150 
Pro Arg Val Pro Lys Leu Ala Ala Ser Ser Ala Arg Val Ser Pro Gly 
20 25 30 35 

GTA TAT GCT 6TG AAG CCG ATT GAT CTT CTG TTA AAA GGA CGA ACT CAT 198 
Val Tyr Ala Val Lye Pro He Asp Leu Leu Leu Lys Gly Arg Thr His 

40 45 50 

CGA ACT AGA AGA TGT GTA GCT CCT GTG AAA AGG AGA ATT GGA TGT ATC 246 
Arg Ser Arg Arg Cys Val Ala Pro Val Lys Arg Arg He Gly Cys He 

55 60 65 

AAA GCG GTG GCT GCT CCA GTT GCA COG CCT TCA GCT GAC A6T GGA GAA 294 
Lye Ala Val Ala Ala Pro Val Ala Pro Pro Ser Ala Asp Ser Ala Glu 
70 75 80 

GAC AGG GAA CAG TTA GCA GAA AGO TAT GGA TTC AGA CAA ATT GGA GAA 342 
Asp Arg Glu Gin Leu Ala Glu Ser Tyr Gly Phe Arg Gin He Gly Glu 
85 90 95 

GAT CTT CCT GAG AAT GTC ACC TTA AAA GAT ATC ATG GAT ACA CTT CCC 390 
Asp X#eu Pro Glu Asn Val Thr I«eu Lys Asp He Met Asp Thr Leu Pro 
100 105 110 115 

AAA GAG GTG TTT GAG ATT GAT GAT CTG AAA GCT TTG AAG TCT GTG TTG 438 
Lys Glu Val Phe Glu He Asp Asp Leu Lys Ala Leu Lys Ser Val I«eu 

120 125 130 

ATA TCT GTG ACT TCA TAG ACT TTG GGG CTC TTC ATG ATT GCA AAA TC6 486 
He Ser Val Thr Ser Tyr Thr X«eu Gly Leu Phe Met He Ala Lys Ser 

135 140 145 

CCG TGG TAT CTG CTA CCG TTG GCT TGG GCA TGG ACA GGA ACT GCA ATT 534 
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Pro Trp Tyr Leu I.eu Pro Leu Ala Trp Ala Trp Thr Gly Thr Ala He 
150 155 160 

ACC 0G6 TTC TTT 6TG ATA 6GT CAT GAT TGT 6GA CAT AAG TCA TTT TCA 582 
Thr Gly Phe Phe Val He Gly Hie Aep Cya Ala His Lye Ser Phe Ser 
165 170 175 

AAG AAC AAA TTG 6TG 6AA OAC ATT GTG G6T ACT CTC GCC TTC CTA CCA 630 
Lye Asn Lye Leu Val Glu Asp He Val Gly Thr Leu Ala Phe Leu Pro 
180 185 190 195 

CTT GTC TAC CCA TAT GAG CCA TGG CGG TTT AAG CAC GAC CGC CAT CAC 678 
Leu Val Tyr Pro Tyr Glu Pro Trp Arg Phe Lye Hie Asp Arg Hie Hie 

200 205 210 

GCC AAA ACC AAC ATG TTA CTT CAT GAC ACA GCT TGG CAG CCA GTT CCG 726 
Ala Lye Thr Aen Met Leu Leu Hie Aep Thr Ala Trp Gin Pro Val Pro 

215 220 225 

CCA GAG GAG TTT GAG TCA TCA CCC GTG ATG AGA AAG GCA ATC ATT TTT 774 
Pro Glu Glu Phe Glu Ser Ser Pro Val Met Arg Lye Ala He He Phe 
230 235 240 

GGA TAT GGC CCA ATT AGA CCT TGG TTG TCC ATA GCT CAC TGG GTG AAC 822 
Gly Tyr Gly Pro He Arg Pro Trp Leu Ser He Ala Hie Trp Val Aen 
245 250 255 

TGG CAC TTC AAT CTG AAA AAG TTC AGA GCG AGC GAG GTG AAT AGG GTG 870 
Trp Hie Phe Asn Leu Lye Lye Phe Arg Ala Ser Glu Val Aen Arg Val 
260 265 270 275 

AAG ATA AGT TTG GCT TGT GTT TTC GCC TTC ATG GCC GTT GGG TGG CCA 918 
Lye He Ser Leu Ala Cye Val Phe Ala Phe Met Ala Val Gly Trp Pro 

280 285 290 

CTG ATC GTA TAC AAA GTT GGT ATA TTG GGA TGG GTA AAA TTC TGG TTA 966 
Leu He Val Tyr Lye Val Gly He Leu Gly Trp Val Lye Phe Trp Leu 

295 300 305 

ATG CCA TGG TTG GGC TAT CAC TTC TGG ATG AGC ACA TTC ACA ATG GTT 1014 
Met Pro Trp Leu Gly Tyr Hie Phe Trp Met Ser Thr Phe Thr Met Val 
310 315 320 

CAT CAT ACG GCT CCG CAT ATA CCT TTC AAG CCT GCG GAT GAG TGG AAC 1062 
Hie Hie Thr Ala Pro Hie He Pro Phe Lye Pro Ala Asp Glu Trp Aen 
325 330 335 

GCG GCT CAG GCC CAG CTG AAT GGA ACT GTT CAT TGT GAC TAC CCT AGT 1110 
Ala Ala Gin Ala Gin Leu Aen Gly Thr Val Hie Cye Aep Tyr Pro Ser 
340 345 350 355 

TGG ATT GAA ATT CTC TGC CAT GAT ATC AAC GTT CAC ATC CCG CAT CAT 1158 
Trp He Glu He Leu Cye Hie Aep He Aen Val Hie He Pro His Hie 

360 365 370 

ATT AGC CCA AGA ATA CCG AGC TAC AAT CTC CGT GCA GCT CAT GAG TCT 1206 
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Tyr Asn I«eu Arg Ala Ala Hie Glu Ser 
380 385 



ATA CAA GAG AAC TGG 6GA AAG TAT ACA AAC TTG GCT ACA TGG AAC TGG 
lie Gin Glu Asn Trp Gly Lye Tyr Thr Aen Leu Ala Thr Trp Asn Trp 
390 395 400 



1254 



CGA TTG ATG AAG ACG ATA ATG ACT GTG TGT CAT 6TC TAT GAC AAA TAGGAGAACT 
1309 

Arg Leu Met Lye Thr lie Met Thr Val Cys His Val Tyr Asp Lys 
405 410 415 



ACATTCCTTT TGACCGGTTA GCCCCTGAAG AATCTCAGCC AATAACCTTC CTCAAGAAAT 



1369 



CAATGCCTAA CTACACAGCC TGATTCGCCA TGGTCTCAAA CTAGTCTTTT GAAATCTCAA 



1429 



TATCTTTTTG CAGTCGCOGA TGTTATATGT AAGCTTTCCA AGCGATGAGC TTCTCTAACA 



1489 



CTTCACCAAC GCTTTATACT GTTATCTTCT TTCCAATCTT ATCAGAAGAG AGAAACTGGT 



1549 



CAAATTATCT GAGCGATTGC AATTCTTTTA TCAGTTTCTT AGCTATAAGA AGATTGAACA 



1609 



GTCTATATAG TTTGCAATGT ACT6TAATGT GATGAAAATT TAGTTGATGA GAAAAAAAAA 



1669 



A 



1670 



(2) INFORMATION FOR SEQ ID MO; 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met Ala Ser Arg lie Ala Asp Ser Leu Phe Ala Phe Thr Gly Pro Gin 
15 10 15 

Gin Cys Leu Pro Arg Val Pro Lys Leu Ala Ala Ser Ser Ala Arg Val 

20 25 30 

Ser Pro Gly Val Tyr Ala Val Lys Pro lie Asp Leu Leu Leu Lys Gly 
35 40 45 

Arg Thr His Arg Ser Arg Arg Cys Val Ala Pro Val Lys Arg Arg lie 
50 55 60 

Gly Cys lie Lys Ala Val Ala Ala Pro Val Ala Pro Pro Ser Ala Asp 
65 70 75 80 



Ser Ala Glu Asp Arg Glu Gin Leu Ala Glu Ser Tyr Gly Phe Arg Gin 

85 90 95 
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Ile Gly Glu Aap Leu Pro Glu Aan Val Thr Leu Lys Asp He Met Asp 

100 105 110 

Tbr Leu Pro Lys Glu Val Phe Glu He Asp Asp Leu Lys Ala Leu Lys 
115 120 125 

Ser Val Leu He Ser Val Thr Ser Tyr Thr Leu Gly Leu Phe Met He 
130 135 140 

Ala Lys Ser Pro Trp Tyr Leu Leu Pro Leu Ala Trp Ala Trp Thr Gly 
145 ISO 155 160 

Thr Ala He Thr Gly Phe Phe Val He Gly His Asp Cye Ala His Lys 

165 170 175 

Ser Phe Ser Lys Asn Lys Leu Val Glu Asp He Val Gly Thr Leu Ala 

180 1B5 190 

Phe Leu Pro Leu Val Tyr Pro Tyr Glu Pro Trp Arg Phe Lys His Asp 
195 200 205 

Arg His His Ala Lys Thr Aon Met Leu Leu His Asp Thr Ala Trp Gin 
210 215 220 

Pro Val pro Pro Glu Glu Phe Glu Ser Ser Pro Val Met Arg Lys Ala 
225 230 235 240 

He He Phe Gly Tyr Gly Pro He Arg Pro Trp Leu Ser He Ala His 

245 250 255 

Trp Val Asn Trp His Phe Asn Leu Lys Lye Phe Arg Ala Ser Glu Val 

260 265 270 

Asn Arg Val Lys He Ser Leu Ala Cys Val Phe Ala Phe Met Ala Val 
275 280 285 

Gly Trp Pro Leu He Val Tyr Lys Val Gly He Leu Gly Trp Val Lys 
290 295 300 

Phe Trp Leu Met Pro Trp Leu Gly Tyr His Phe Trp Met Ser Thr Phe 
305 310 315 320 

Thr Met Val His His Thr Ala Pro His He Pro Phe Lys Pro Ala Asp 

325 330 335 

Glu Trp Asn Ala Ala Gin Ala Gin Leu Asn Gly Thr Val His Cys Asp 

340 345 350 

Tyr Pro Ser Trp He Glu He Leu Cys His Asp He Asn Val His He 
355 360 365 

Pro His His He Ser Pro Arg He Pro Ser Tyr Asn Leu Arg Ala Ala 
370 375 380 

His Glu Ser ile Gin Glu Asn Trp Gly Lys Tyr Thr Asn Leu Ala Thr 
385 390 395 400 
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-.w Ti« Met Thr Val cy8 Hxb Val Tyr 
Trp A.n Trp »r9 I*« Het l-y^ Thr iXe Met Thr 
*' 405 ^"^ 
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Claims: 

1. A genetically transformed plant which has an elevated 
Unolenic acid content comprising a recombinant, double-stranded DNA 

5 molecule comprising 

(i) a promoter that functions in plant cells to cause 

the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a structural coding sequence that causes the 
10 production of an RNA sequence that encodes a Unoleic 

add desaturase activity; and 

(iii) a 3* non-translated region that functions in plant 
cells to promote polyadenylation to the 3* end of said RNA 
sequence. 

15 2. The plant of claim 1 in which the Unoleic acid desaturase 

activity is from plants. 

3. The plant of claim 1 in which the Unoleic acid desaturase 
activity is from fungi, algae or bacteria. 

4. The plant of claim 1 in which the structural coding 
20 sequence of (ii) is taken from SEQ. ID N0:1. 

5. The plant of claim 1 in which the structural coding 
sequence of (ii) is taken from SEQ. ID N0:9. 

6. The plant of claim 1 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NOrll. 

25 7. The plant of claim 1 in which the promoter of (i) is an 

endogenous plant Unoleic acid desaturase promoter. 

8. A genetically transformed plant which has a reduced 
Unolenic acid content, comprising a recombinant, double-stranded DNA 
molecule comprising 
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(i) a promoter that fiinctions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a DNA sequence that causes the production of an 
. RNA sequence that is in antisense orientation to at least 

a portion of a gene that encodes a linoleic add desatiirase 
activity in said plant; and 

(iii) a 3' non-ta-anslated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

9. The plant of claim 8 in which tiie Unoleic acid desatiirase 

enzyme is from plants. 

10. The plant of claim 8 in which the Unoleic acid desaturase 



enzyme is from fungi, algae or bacteria. 

11. The plant of claim 8 in which the structural coding 

sequence of (ii) is taken from SEQ. ID N0:1. 

12. The plant of claim 8 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:9. 

13. The plant of claim 8 in which the structural coding 

20 sequence of (ii) is taken from SEQ. 8 ID NO:ll. 

14. The plant of claim 8 in which the promoter of (i) is an 
endogenous plant Unoleic acid desaturase promoter. 

15 A genetically transformed plant which has an improved 
resistance to low temperatures comprising a recombinant, double-sti-anded 

25 DNA molecule comprising 

(i) a promoter that functions in plant ceUs to cause 

the production of an RNA sequence, said promoter 
operably linked to; 
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(ii) a structural coding sequence that causes the 
production of an RNA sequence that encodes a liholeic 
add desaturase activity; and 

(iii) a 3* non-translated region that functions in plant 
5 cells to promote polyadenylation to the 3' end of said RNA 

sequence. 

16. A genetically transformed plant which has an elevated 
ability to respond to pathogens^ comprising a recombinant^ double-stranded 
DNA molecule comprising 

10 (i) a promoter that functions in plant cells to cause 

the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a structural coding sequence that causes the 
production of an RNA sequence that encodes a linoleic 

15 acid desaturase activity; and 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

17, A seed produced from genetically transformed plant where 
20 said seed has an linolenic acid content suitable for use as a source of 

linolenic acid, said plant comprising a recombinant, double-stranded DNA 

molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
25 operably linked to; 

(it) a structural coding sequence that causes the 
production of an RNA sequence that encodes a linoleic 
acid desaturase activity; and 
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(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

IS.The seed of claim 17 where said plant is selected from the 

5 group consistmg of soybean and rapeseed. 

19. A genetically transformed plant which has a linolenic acid 
content of less than about 3%, said plant comprising a recombinant, 
double-stranded DNA molecule comprising 

(i) a promoter that functions in plant cells to cause 
10 the production of an RNA sequence, said promoter 

operably linked to; 

(ii) a DNA sequence that causes the production of an 
RNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a linoleic acid desaturase 

15 activity in said plant; and 

(iii) a 3' non*translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

20. A genetically transformed plant which has an increased 
20 oleic acid content, comprising a recombinant, double-stranded DNA 

molecule comprising 

(i) a promoter that functions in plant cells to cause 

the production of an RNA sequence, said promoter 
operably linked to; 

25 (ii) a DNA sequence that causes the production of an 

RNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a oleic acid desaturase 
activity in said plant; and 
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(iii) a 3* non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 
sequence. 

21. A genetically transformed plant which has an increased 
5 oleic acid content, comprising a recombinant, double-stranded DNA 

molecule comprising 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

10 (ii) a DNA sequence that causes the production of an 

RNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a linoleic add desaturase 
activity in said plant; and 

(iii) a 3* non-translated region that functions in plant 
15 cells to promote polyadenylation to the 3' end of said RNA 

sequence. 

22. A method of producing a genetically transformed plant 
which has an elevated linolenic add content, comprising 

(a) inserting into the genome of a plant cell a 
20 recombinant, double-stranded DNA molecule comprising: 

(i) a promoter that functions in plant cells to 
cause the production of an RNA sequence, said 
promoter operably linked to; 

(ii) a structural coding sequence that causes 
25 the production of an RNA sequence that encodes 

a linoleic add desaturase activity; and 

(iii) a 3* non-translated region that functions in 
plant cells to promote polyadenylation to the 3* 
end of said RNA sequence; 
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(b) obtaining transformed plant cells; and 

(c) regenerating from the transformed plant cells 
genetically transformed plants which have an elevated 
linolenic add content. 

5 23. The method of claim 22 in which the linoleic acid 

desaturase enzyme is from plants. 

24. The method of claim 22 in which the linoleic acid 
desaturase enzyme is from fungi, algae or bacteria. 

25. The method of claim 22 in which the structural coding 
10 sequence of (ii) is taken from SEQ. ID NO:l. 

26. The method of claim 22 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:9. 

27. The method of claim 22 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO: 11. 

15 28. The plant of claim 22 in which the promoter of (i) is an 

endogenous plant linoleic acid desaturase promoter. 

29. A method of producing a genetically transformed plant 
which has a reduced linolenic add content, comprising 

(a) inserting into the genome of a plant cell a 
20 recombinant, double-stranded DNA molecule comprising: 

(i) a promoter that functions in plant cells to 
cause the production of an RNA sequence, said 
promoter operably linked to; 

(ii) a DNA sequence that causes the 
25 production of an RNA sequence that is in 

antisense orientation to at least a portion of a 
gene that encodes a linoleic acid desaturase 
activity in said plant; and 
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(iii) a 3' non-translated region that functions in 
plant cells to promote polyadenylation to the 3' 
end of said RNA sequence 
(b) obtaining transformed plant cells; and 
5 (c) regenerating from the transformed plant cells 

genetically transformed plants which have a reduced 
Hnolenic add content. 
30. The method of claim 29 in which the linoleic acid 
desaturase enzyme is from plants. 
10 31. The method of claim 29 in which the linoleic acid 

desaturase enzyme is from fungi, algae or bacteria. 

32. The method of claim 29 in which the structural coding 

sequence of (ii) is taken from SEQ. ID N0:1. 

33. The method of claim 29 in which the structural coding 
15 sequence of (ii) is taken from SEQ. ID NO:9. 

34. The method of claim 29 in which the structural coding 
sequence of (ii) is taken from SEQ. ID NO:ll. 

35. The plant of claim 29 in which the promoter of (i) is an 
endogenous plant linoleic add desatinrase promoter. 

20 36. A method of produdng a genetically transformed plant 

which has an increased oleic add content, comprising 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule comprising: 

(i) a promoter that functions in plant cells to 
25 cause the production of an RNA sequence, said 

promoter operably Unked to; 

(ii) a DNA sequence that causes the 
production of an RNA sequence that is in 
antisense orientation to at least a portion of a 
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gene that encodes a linoleic acid desaturase 
activity in said plant; and 

(iii) a 3' non-translated region that functions in 
plant cells to promote polyadenylation to the 3' 
5 end of said KNA sequence 

(b) obtaining transformed plant cells; and 

(c) regenerating from the transformed plant cells 
genetically transformed plants which have an increased 
oleic add content. 

10 37. A recombinant, double-stranded DNA molecule 

comprising in sequence: 

(i) a promoter that functions in plant cells to cause 

the production of an RNA sequence, said promoter 
operably linked to; 

15 (ii) a structural coding sequence that causes the 

production of an RNA sequence that encodes a linoleic 
add desaturase activity; and 

(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3* end of said RNA 
20 sequence. 

38. A recombinant, double-stranded DNA molecule 
comprising in sequence: 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence ^ said promoter 

2 5 operably linked to; 

(ii) a DNA sequence that causes the production of an 
RNA sequence that is in antisense orientation to at least 
a portion of a gene that encodes a linoleic acid desaturase 
activity in said plant; and 
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(iii) a 3' non-translated region that functions in plant 
cells to promote polyadenylation to the 3' end of said RNA 

sequence* 

39. A plant cell comprising a recombinant, double- 
5 stranded DNA molecule comprising in sequence: 

(i) a promoter that functions in plant cells to cause 
the production of an RNA sequence, said promoter 
operably linked to; 

(ii) a DNA sequence that causes the production of an 
10 BNA sequence that is in antisense orientation to at least 

a portion of a gene that encodes a linoleic add desaturase 
activity in said plant; and 

(iii) a 3' non-translated region that fimctions in plant 
cells to promote polyadenylation to the 3' end of said RNA 

15 sequence. 

40, A method of producing a genetically transformed plant 
which has an increased oleic acid content, comprising 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule comprising: 
20 (i) a promoter that functions in plant cells to 

cause the production of an RNA sequence, said 

promoter operably linked to; 

(ii) a DNA sequence that causes the 
production of an RNA sequence that is in 
25 antisense orientation to at least a portion of a 

gene that encodes a oleic acid desaturase activity 
in said plant; and 



SUBSTITUTE SHEET (RULE 26) 



wo 94/18337 



/US94/01321 



-114- 



Gii) a 3" non-translated region that functions in 
plant cells to promote polyadenylation to the 3" 
end of said RNA sequence 

(b) obtaining transformed plant cells; and 

(c) regenerating from the transformed plant cells 
genetically transformed plants which have an increased 
oleic acid content. 
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MTCCATCAA ACCTTTATTC ACCACATTTC ACTGAAAGGC CACACATCTA GAGAGAGAAA 60 

CTTCGTCCAA ATCTCTCTCT CCAGCC ATG GTT GTT GCT ATG GAG GAG CGC AGC 113 

Mel Vol Vol Alo Mel Asp Gin Arg Ser 
1 5 

AAT GTT AAC GGA GAT TCC GGT GCC CGG AAG GAA GAA GGG TTT GAT CCA 161 
Asn Vol Asn Giy Asp Ser Gly Ala Arg Lys Glu Glu Gly Phe Asp Pro 
10 15 20 25 

AGC GCA CAA CCA COG TTT AAG ATG GGA GAT ATA AQG GCG GCG ATT CCT 209 
Ser Alo Gin Pro Pro Phe Lys He Giy Asp He Arg Alo Alo He Pro 

30 35 40 

AAG CAT TGC TGG GTG AAG AGT CCT TTG AGA TCT ATG AGC TAG GTC ACC 257 
Lys His Cys Trp Vol Lys Ser Pro Leu Arg Ser Mel Ser Tyr Vol Thr 

45 50 55 

AGA GAC ATT TTC GCC GTC GCG GCT CTG GCC ATG GCC GCC GTG TAT TTT 305 
Arg Asp lie Phe Alo Vol Alo Alo Leu Alo Mel Alo Alo Vol Tyr Phe 
60 65 70 

GAT AGC TGG TTC CTG TGG CCA GTC TAC TGG GTT GCC CAA GGA ACC GTT 353 
Asp Ser Trp Phe Leu Trp Pro Leu Tyr Trp Vol Alo Gin Gly Thr Leu 
75 80 85 

TTC TGG GCC ATC TTC GTT CTT GGC CAC GAC TGT GGA CAT GGG AGT TTC 401 
Phe Trp Alo He Phe Vol Leu Gly His Asp Cys Gly His Gly Ser Phe 
90 95 100 105 

TCA GAC ATT CCT CTG CTG AAC AGT GTG GTT GGT CAC ATT CTT CAT TCA 449 
Ser Asp He Pro Leu Leu Asn Ser Vol Vol Gly His He Leu His Ser 

110 115 120 

TTC ATC GTC GTT CCT TAC CAT GGT TGG AGA ATA AGC CAT CGG ACA CAC 497 
Phe He Leu Vol Pro Tyr His Gly Trp Arg He Ser His Arg Thr His 
125 130 135 

CAC GAG AAC CAT GGC CAT GTT GAA AAC GAC GAG TCT TGG GTT CCG TTG 545 
His Gin Asn His Gly His Vol Glu Asn Asp Glu Ser Trp Vol Pro Leu 
140 145 150 
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CCA GAA AAG TTG TAC AAG AAC TTG CCC CAT ACT ACT CGG ATG CTC AGA 593 
Pro Glu Lys Leu Tyr Lys Asn Leu Pro His Ser Thr Arg Met Leu Arg 
155 160 165 

TAC ACT GTC CCT CTG CCC ATG CTC GCT TAC COG ATC TAT CTG TGG TAC 641 
Tyr Thr Vol Pro Leu Pro Met Leu Alo Tyr Pro He Tyr Leu Trp Tyr 
170 175 180 185 

AGA AGT CCT GGA AAA GAA GGG TCA CAT TTT AAC CCA TAC ACT AGT TTA 689 
Arg Ser Pro Gly Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser Leu 

190 195 200 



TTT GCT CCA AGC GAG AGG AAG CTT ATT GCA ACT TCA ACT ACT TGC TGG 737 
Pile Alo Pro Ser Glu Arg Lys Leu He Alo Thr Ser Thr Thr Cys Trp 
205 210 215 

TCC ATA ATG TTG GCC ACT CTT GTT TAT CTA TOG TTC CTC GTT GAT CCA 785 
Ser He Met Leu Alo Thr Leu Vol Tyr Leu Ser Phe Leu Vol Asp Pro 
220 225 230 

GTC ACA GTT CTC AAA GTC TAT GGC GTT CCT TAC ATT ATC TTT GTG ATG 833 
Vol Thr Vol Leu Lys Vol Tyr Gly Vol Pro Tyr He He Phe Vol Mel 
235 240 245 



TGG TTG GAC GCT GTC ACG TAC TTG CAT CAT CAT GGT CAC GAT GAG AAG 881 
Trp Leu Asp Alo Vol Thr Tyr Leu His His His Gly His Asp Glu Lys 
250 255 260 265 

TTG CCT TGG TAC AGA GGC AAG GAA TGG AGT TAT TTA CGT GGA GGA TTA 929 
Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 

270 275 280 

ACA ACT ATT GAT AGA GAT TAC GGA ATC TTC AAC AAC ATC CAT CAC GAC 977 
Thr Thr lie Asp Arg Asp Tyr Gly He Phe Asn Asn He His His Asp 
285 290 295 

ATT GGA ACT CAC GTG ATC CAT CAT CTT TTC CCA CAA ATC CCT CAC TAT 1025 
He Gly Thr His Vol He His His Leu Phe Pro Gin He Pro His Tyr 
300 305 310 
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CAC TTG GTC GAT GCC ACQ AGA GCA GOT AAA CAT GTG TTA GGA AGA TAG 1073 
His Leu Vol Asp Ala Thr Arg Alo Ala Lys His Vol Leu Gly Arg Tyr 
315 320 325 

TAG AGA GAG CCG AAG ACG TCA GGA GCA ATA CCG ATT CAC TTG GTG GAG 1 121 
Tyr Arg Glu Pro Lys Thr Ser Gly Alo He Pro He His Leu Vol Glu 
330 335 340 345 

ACT TTG GTC GCA AGT ATT AAA AAA GAT CAT TAG GTC AGT GAG ACT GGT 1169 
Ser Leu Vol Alo Ser He Lys Lys Asp His Tyr Vol Ser Asp Thr Gly 

350 355 360 

GAT ATT GTC TTC TAC GAG ACA GAT CCA GAT CTC TAC GTT TAT GCT TCT 1217 
Asp He Vol Phe Tyr Glu Thr Asp Pro Asp Leu Tyr Vol Tyr Alo Ser 
365 370 375 

GAC AAA TCT AAA ATC AAT TAACTTTTCT TCCTAGCTCT ATTAGGAATA 1265 
Asp Lys Ser Lys I le Asn 
380 

AACACTCCTT CTCTTTTACT TATTTGTTTC TGCTTTAAGT TTAAAATGTA CTCGTGAAAC 1325 
CTTTTTTTTA TTAATGTATT TACGTTAC 1353 



FIG.3C 
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Mel vol Vol Alo Met Asp Gin Arg Ser Asn Vol Asn Gly Asp Ser Gly 



1 



Alo Arg Lys Glu Glu Gly Phe Asp Pro Ser Aio Gin Pro Pro Phe Lys 

20 25 

lie Gly Asp He Arg Alo Alo lie Pro Lys His Cys Trp Vol Lys Ser 
35 *0 

Pro Leu Arg Ser Met Ser Tyr Vol Thr Arg Asp lie Phe Alo Vol Alo 

50 5S 
Alo Leu Alo Mel Alo Alo Vol Tyr Phe Asp Ser Trp Phe Leu Trp Pro 



65 



70 



Leu Tyr Trp Vol Alo Gin Gly Thr Leu Phe Trp Alo lie Phe Vol Leu 
Gly His ASP Gl, His Gl, Ser Ph^ Ser Asp II. Pro Leu Leu As. 

Ser Vol Vol Gly His Me Leu His Ser Ph. II. Leu Vol Pro Tyr His 

115 

Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His Vol 



130 



135 



Glu Asn ASP Glu Ser Trp Vol Pro Leu Pro Glu Lys Leu Tyr Lys /ten 

150 ^'^'^ 



145 

Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Vol Pro Leu Pro Met 

165 

Leu Alo Tyr Pro He Tyr Leu Trp Tyr Arg Ser Pro Gly Lys Glu Gly 

180 

Ser His Phe Asn Pro Tyr Ser Se^ Leu Phe Alo Pro Ser Glu Arg Lys 

Leu He Alo Thr Ser Thr Thr Cys Trp Ser He Met Leu Alo Thr Leu 

210 215 
Vol Tyr Leu Ser Phe Leu Vol Asp Pro Vol Thr Vol Leu Lys Vol Tyr 



225 



230 



Gly Vol Pro Tyr He He Phe Vol Met Trp Leu Asp Alo Vol Thr Tyr 
' 245 250 
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Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr Arg Gly Lys 

260 265 

Glu Trp Ser T,r Leu Arg Gly Gl, Le. Thr Thr He tap Arg Asp T,r 



275 



G„ lie Phe Asn Asn lie His His Asp lie CI, ^ His Vol lie His 

290 

His Leu Phe Pro Gin He Pro His Tyr His Leu Vol Asp Alo Thr Arg 
305 310 J' 3 

Alo Alo Lys His Vol Leu Gly Arg Tyr Tyr Arg Glu Pro Lys Thr Ser 

325 330 

Gly Alo lie Pro He His Leu Vol Glu Ser Leu Vol Alo Se^ He Lys 

340 

Lys ASP His Tyr Vol Ser Asp Thr Gly Asp He Vol Phe Tyr Glu Thr 

335 



Asp Pro Asp Leu Tyr Vol Tyr Alo Ser Asp Lys Ser Lys He Asn 
370 375 ^ 
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10 20 30 40 50 60 

BND3 . AMI RSNVNGDSGARKEEGFOPSAQPPFK IGD I RAA I PKHCWVKSPLRSMSYVTRO I F AVAAL A 



DESA . AMI MTAT I PPLTPTVTPSNPDRP I ADLKLQD 1 1 KTLPKECFEKKASKAWASVL I TLGA I AVGY 

10 20 30 40 50 60 

70 80 90 100 110 120 

BND3 . AM I MAAVYFOSWF LWPL YWVAQGTLFWA I F VLGHDCGHGSFSD I PLLNSWGH I LHSF I L VPY 

DESA .AMI LG I i YL-PWYCLP i TwiwTGTALTGAFWGHDCGHRSFAKKRWVNDL VGH I AFAPL I YPF 

70 80 90 100 110 

130 140 150 160 170 180 

BN03 . AM I HGWR I SHRTHHQNHGHVENDESWVPLPEKL YKNLPHSTRMLRYTVPL PH-LAYP I YLWYR 
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DESA . AMI L VYHFWMSTFT I VHHT I PE I RF-H?PAADWSAAEAaNGTVHCDYPRWVEVLCHD I NVH i 
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310 320 330 340 350 360 
BND3 . AM 1 I HHLFPO I PHYHL VDATRAAKHVLGRYYREPKTSGAI P I HLVESL VAS I KKDHYVSDTGD 
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GGAAAACACA AGTTTCTCTC ACACACATTA TCTCTTTCTC TATTACCACC ACTCATTCAT 60 
AACAGAAACC CACCAAAAAA TAAAAAGAGA GACTTTTCAC TCTGGGGAGA GAGCTCAAGT 120 

TCTA ATG GC6 AAC TTG GTC TTA TCA GAA TGT GGT ATA CGA COT CTC COG 169 
Met Alo Asn Leu Vol Leu Ser Glu Cys Gly He Arg Pro Leu Pro 
1 5 10 15 

AGA ATG TAG ACA AGA CCG AGA TCC AAT TTG CTG TCC AAC AAG AAC AAA 217 
Arg lie Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys 

20 25 30 

TTC AGA CCA TCA CTT TCT TCT TCT TCT TAC AAA ACA TCA TCA TGT CCT 265 
Phe Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser Ser Pro 

35 40 45 

CTG TCT TTT GGT CTG AAT TCA CGA GAT GGG TTC ACG AGG AAT TGG GCG 313 
Leu Ser Phe Gly Leu Asn Ser Arg Asp Giy Phe Thr Arg Asn Trp Alo 
50 55 60 

TTG AAT GTG AGC ACA CCA TTA ACG ACA CCA ATA TTT GAG GAG TCT CCA 361 
Leu Asn Vol Ser Thr Pro Leu Thr Thr Pro lie Phe Glu Glu Ser Pro 
65 70 75 

TTG GAG GAA GAT AAT AAA GAG AGA TTC GAT CCA GGT GCG GGT CCT CCG 409 
Leu Glu Glu Asp Asn Lys CIn Arg Phe Asp Pro Gly Alo Pro Pro Pro 
80 85 90 95 

TTC AAT TTA GCT GAT ATT AGA GCA GGT ATA CCT AAG CAT TGT TGG GTT 457 
Phe Asn Leu Ale Asp He Arg Alo Alo He Pro Lys His Cys Trp Vol 

100 105 110 

AAG AAT CCA TGG AAG TCT TTG AGT TAT GTC GTC AGA GAG GTC GCT ATG 505 
Lys Asn Pro Trp Lys Ser Leu Ser Tyr Vol Vol Arg Asp Vol Alo lie 
115 120 125 

GTC TTT GCA TTG GCT GCT GGA GCT GCT TAC CTC AAC AAT TGG ATT GTT 553 
Vol Phe Alo Leu Alo Alo Giy Alo Alo Tyr Leu Asn Asn Trp ile Vol 
130 135 140 
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TGG CCT CTC TAT TGG CTC GCT CAA GGA ACC ATG TTT TGG GOT CTC TTT 601 
Trp Pro Leu Tyr Trp Leu Alo Gin Gly Thr Mel Phe Trp Ala Leu Phe 
U5 150 155 

GTT CTT GGT CAT 6AC TGT GGA CAT GGT AGT TTC TCA AAT GAT CCG AAG 649 
Vol Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys 
160 165 170 175 

TTG AAC AGT GTG GTC GGT CAT CTT CTT CAT TCC TCA ATT CTG GTC CCA 697 
Leu Asn Ser Vol Vol Gly His Leu Leu His Ser Ser He Leu Vol Pro 

180 185 190 

TAC CAT GGC TGG AGA ATT AGT CAC AGA ACT CAC CAC CAG AAC CAT GGA 745 
Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly 
195 200 205 

CAT GTT GAG AAT GAC GAA TCT TGG CAT CCT ATG TCT GAG AAA ATC TAC 793 
His Vol Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys He Tyr 
210 215 220 

AAT ACT TTG GAC AAG CCG ACT AGA TTC TTT AGA TTT ACA CTG CCT CTC 841 
Asn Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu 
225 230 235 

GTG ATG CTT GCA TAC CCT TTC TAC TTG TGG GCT CGA AGT CCG GGG AAA 889 
Vol Mel Leu Alo Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys 
240 245 250 255 

AAG GGT TCT CAT TAC CAT CCA GAC AGT GAC TTG TTC CTC CCT AAA GAG 937 
Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu 

260 265 270 

AGA AAG GAT GTC CTC ACT TCT ACT GCT TGT TGG ACT GCA ATG GCT GCT 985 
Arg Lys Asp Vol Leu Thr Ser Thr Alo Cys Trp Thr Alo Mel Ala Alo 
275 280 285 

CTG CTT GTT TGT CTC AAC TTC ACA ATC GGT CCA ATT CAA ATG CTC AAA 1033 
Leu Leu Vol Cys Leu Asn Phe Thr He Gly Pro He Gin Mel Leu Lys 
290 295 300 
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CTT TAT GGA ATT CCT TAC TGG ATA AAT GTA ATG TGG TTG GAC TTT GTG 1081 
Leu Tyr Gly He Pro Tyr Trp He Asn Vol Mel Trp Leu Asp Phe Vol 
305 310 315 

ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG CTT CCT TGG TAC CGT 1129. 
Thr Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arg 
320 325 330 335 

GGC AAG GAG TGG AGT TAC CTG AGA GGA GGA CTT ACA ACA TTG GAT CGT 1 177 
Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg 

340 345 350 

GAC TAC GGA TTG ATC AAT AAC ATC CAT CAT GAT ATT GGA ACT CAT GTG 1225 
Asp Tyr Gly Leu He Asn Asn He His His Asp He Gly Thr His Vol 
355 360 365 

ATA CAT CAT CTT TTC COG CAG ATC CCA CAT TAT CAT CTA GTA GAA GCA 1273 
He His His Leu Phe Pro Gin He Pro His Tyr His Leu Vol Glu Alo 
370 375 380 

ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT TAC AGG GAG CCT GAT 1321 
Thr Glii Alo Alo Lys Pro Vol Leu Gly Lys Tyr Tyr Arg Glu Pro Asp 
385 390 395 

AAG TCT GGA CCG TTG CCA TTA CAT TTA CTG GAA ATT CTA GCG AAA AGT 1369 
Lys Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Alo Lys Ser 
400 405 410 415 

ATA AAA GAA GAT CAT TAC GTG AGC GAC GAA GGA GAA GTT GTA TAC TAT 1417 
He Lys Glu Asp His Tyr Vol Ser Asp Glu Gly Glu Vol Vol Tyr Tyr 

420 425 430 

AAA GCA GAT CCA AAT CTC TAT GGA GAG GTC AAA GTA AGA GCA GAT TGAAATGAAG 1472 
Lys Alo Asp Pro Asn Leu Tyr Gly Glu Vol Lys Vol Arg Alo Asp 
435 440 445 

CAGGCTTGAG ATTGAAGTTT TTTCTATTTC AGACCAGCTG ATTTTTTGCT TACTGTATCA 1532 
ATTTATTGTG TCACCCACCA GAGAGTTAGT ATCTCTGAAT ACGATCGATC AGATGGAAAC 1592 
AACAAATTTG TTTGCGATAC TGAAGCTATA TATACCATAA AAAAAAAAAA AAA 1645 
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Mel Alo Asn Leu Vol Leu Ser Glu Cys Gly He Arg Pro Leu Pro Arg 
1 5 10 15 

He Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys Phe 

20 25 30 

Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser Ser Pro Leu 
35 40 45 

Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Alo Leu 
50 55 60 

Asn Vol Ser Thr Pro Leu Thr Thr Pro He Phe Glu Glu Ser Pro Leu 
65 70 75 80 

Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Alo Pro Pro Pro Phe 

85 90 95 

Asn Leu Alo Asp He Arg Alo Alo He Pro Lys His Cys Trp Vol Lys 
100 105 110 

Asn Pro Trp Lys Ser Leu Ser Tyr Vol Vol Arg Asp Vol Alo He Vol 
115 120 125 

Phe Alo Leu Ala Alo Gly Alo Alo Tyr Leu Asn Asn Trp He Vol Trp 
130 135 140 

Pro Leu Tyr Trp Leu Alo Gin Gly Thr Met Phe Trp Alo Leu Phe Vol 
145 150 155 160 

Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys Leu 

165 170 175 

Asn Ser Vol Vol Gly His Leu Leu His Ser Ser He Leu Vol Pro Tyr 
180 185 190 

His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His 
195 200 205 

Vol Glu Asn Asp Glu Ser Trp His Pro Mel Ser Glu Lys He Tyr Asn 
210 215 220 
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Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu Vol 
225 230 235 

Mel Leu Alo Tyr Pro Phe Tyr Leu Trp Alo Arg Ser Pro Gly Lys Lys 

245 250 

Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu Arg 
' 260 265 270 

Lys Asp Vol Leu Thr Ser Thr Alo Cys Trp Thr Alo Met Alo Alo Leu 

27S 

Leu Vol Cys Leu Asn Phe Thr He Gly Pro He Gin Mel Leu Lys Leu 
290 295 300 

Tyr Gly He Pro Tyr Trp He Asn Vol Mel Trp Leu Asp Phe Vol Thr 
305 310 31 i 

Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arg Gly 

325 

Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg Asp 

340 345 <53u 

Tyr Gly Leu He Asn Asn He His His Asp He Gly Thr His Vol He 
^ 355 360 365 

His His Leu Phe Pro Gin He Pro His Tyr His Leu Vol Glu Alo Thr 
370 375 380 

Glu Alo Alo Lys Pro Vol Leu Gly Lys Tyr Tyr Arg Glu Pro Asp Ly^ 
385 390 

Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Alo Lys Ser He 

405 

Lys Glu Asp His Tyr Vol Ser Asp Glu Gly Glu Vol Vol Tyr Tyr Lys 

420 *25 '^^^ 

Alo Asp Pro Asn Leu Tyr Gly Glu Vol Lys Vol Arg Alo Asp 
. 435 440 
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AGAGAGTGCA AATAGMCGA CAGAGACTTT TTCCTCTTTT CTTCTTGGGA AGAGGCTCCA 60 

ATG GCG AGO TCG GTT TTA TCA GAA TGT GGT TTT AGA OCT CTC CCC AGA 108 
Mel Alo Ser Ser Vol Leu Ser Glu Cys Gly Phe Arg Pro Leu Pro Arg 
15 10 15 

TTC TAG CCT AAA GAG AGA ACC TGT TTT GGC TCT AAC CCT AAA CCC ACT 156 
Phe Tyr Pro Lys His Thr Thr Ser Phe Alo Ser Asn Pro Lys Pro Thr 

20 25 30 

TTG AAA TTC AAT CCA CCA GTT AAA CCT CCT TCT TCT CTT CTC AAT TCC 204 
Phe Lys Phe Asn Pro Pro Leu Lys Pro Pro Ser Ser Leu Leu Asn Ser 
35 40 45 

CGA TAT GGA TTC TAG TCT AAA ACC AGG AAC TGG GCA TTG AAT GTG GCA 252 
Arg Tyr Gly Phe Tyr Ser Lys Thr Arg Asn Trp Alo Leu Asn Vol Alo 
50 55 60 

ACA CCT TTA ACA ACT CTT CAG TCT CCA TCC GAG GAA GAG AGG GAG AGA 300 
Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Thr Glu Arg 
65 70 75 80 

TTC GAG CCA GGT GCG CCT CCT CCC TTC AAT TTG GCG GAT ATA AGA GCA 348 
Phe Asp Pro Gly Alo Pro Pro Pro Phe Asn Leu Alo Asp He Arg Alo 

85 90 95 

GGC ATA CCT AAG CAT TGT TGG GTT AAG AAT CCA TGG ATG TCT ATG AGT 395 
Alo He Pro Lys His Cys Trp Vol Lys Asn Pro Trp Mel Ser Mel Ser 

100 105 110 

TAT GTT GTG AGA GAT GTT GCT ATG GTG TTT GGA TTG GGT GGT GTT GGT 444 
Tyr Vol Vol Arg Asp Vol Alo He Vol Phe Gly Leu Ala Alo Vol Alo 
115 120 125 

GCT TAG TTG AAC AAT TGG CTT CTC TGG CCT CTC TAC TGG TTC GCT GAA 492 
Alo Tyr Phe Asn Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Alo Gin 
130 135 140 

GGA ACC ATG TTC TGG GCT CTC TTT GTG CTT GGC CAT GAG TGG GGA CAT 540 
Gly Thr Mel Phe Trp Alo Leu Phe Vol Leu Gly His Asp Cys Gly His 

145 150 155 160 
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GGT AGC TTC TCG AAT GAT CCG /«X; CTG AAC AGT GTG GOT GGT CAT CTT 588 
Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Vol Alo Gly His Leu 

165 170 175 

CTT CAT TCC TCA ATT CTG GTC CCT TAC CAT GGC TGG AGG ATT AGC CAC 636 
Leu His Ser Ser lie Leu Vol Pro Tyr His Gly Trp Arg He Ser His 
180 185 190 

AGA ACT CAC CAC CAG AAC CAT GGT CAT GTC GAG AAT GAC GAA TCA TGG 684 
Arg Thr His His Gin Asn His Gly His Vol Glu Asn Asp Glu Ser Trp 
195 200 205 

CAT CCT TTG CCT GAA AGC ATC TAC AAG AAT TTG GAA AAG ACG ACT CAA 732 
His Pro Leu Pro Glu Ser He Tyr Lys Asn Leu Glu Lys Thr Thr Gin 
210 215 220 

ATG TTT AGG TTT ACA CTG CCT TTT CCA ATG CTC GCA TAC CCT TTC TAC 780 
Met Phe Arg Phe Thr Leu Pro Phe Pro Mel Leu Alo Tyr Pro Phe Tyr 
225 230 235 240 

TTG TGG AAC AGA AGT CCA GGG AAA CAA GGT TCT CAT TAT CAT CCG GAC 828 
Leu Trp Asn Arg Ser Pro Gly Lys Gin Gly Ser His Tyr His Pro Asp 

245 250 255 

AGT GAC TTG TTT CTT CCA AAA GAG AAG AAA GAT GTT CTG ACA TCA ACT 876 
Ser Asp Leu Phe Leu Pro Lys Glu Lys Lys Asp Vol Leu Thr Ser Thr 
260 265 270 

GCC TGT TGG ACT GCA ATG GCT GCT TTG CTT GTT TGT CTC AAC TTT GTC 924 
Alo Cys Trp Thr Alo Mel Alo Alo Leu Leu Vol Cys Leu Asn Phe Vol 
275 280 285 

ATG GGT CCA ATC CAG ATG CTC AAA CTA TAT GGC ATC CCT TAT TGG ATA 972 
Mel Gly Pro lie Gin Met Leu Lys Leu Tyr Gly lie Pro Tyr Trp lie 
290 295 300 

TTT GTA ATG TGG TTG GAC TTC GTC ACT TAC TTG CAC CAC CAT GGA CAT 1020 
Phe Vol Mel Trp Leu Asp Phe Vol Thr Tyr Leu His His His Gly His 
305 310 315 320 



FIG. 12b 

RECTIFIED SHEET (RULE 91) 

ISA/EP 



wo 94/18337 



# 



/US94/01321 



23/25 

CM GAC AAG CTC CCT TGC TAT CGT GGA AAG GAA TGG AGT TAG CTG AGA 1068 
Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg 

325 330 335 

GGA GGG CTC ACA ACA TTA GAT CGT GAC TAC GGA TGG ATC AAT AAC ATC 11. 16 
Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp He Asn Asn He 
340 345 350 

CAC CAC GAT ATT GGA ACT CAT GTG ATA CAT CAT CTT TTC CCG CAG ATC 1164 
His His Asp He Gly Thr His Vol He His His Leu Phe Pro Gin He 
355 360 355 

CCA CAT TAT CAT CTA GTA GAA CCA ACA GAA GCA GCT AAA CCA GTA CTA 1212 
Pro His Tyr His Leu Vol Glu Alo Thr Glu Alo Ala Lys Pro Vol Leu 
370 375 380 

GGA AAG TAC TAC AGA GAA CCG AAA AAC TCT GGA CCT CTG CCA CTT CAC 1250 
Gly Lys Tyr Tyr Arg Glu Pro Lys Asn Ser Gly Pro Leu Pro Leu His 
385 390 395 400 

TTA CTG GGA AGC CTC ATA AAG AGT ATG AAA CAA GAC CAT TTC GTA AGC 1308 
Leu Leu Gly Ser Leu He Lys Set Met Lys Gin Asp His Phe Vol Ser. 

405 410 415 

GAT ACA GGA GAT GTC GTG TAC TAT GAG GCA GAT CCA AAA CTC AAT GGA 1356 
Asp Thr Gly Asp Vol Vol Tyr Tyr Gtu Alo Asp Pro Lys Leu Asn Gly 
420 425 430 

CAA AGA ACA TGAGGACATA CTGCAGTGAA CCAGGCAGAC AACTTACATA 1405 
G In Arg Thr 
435 

AATTCATCTT GGCCCATTCA TTATGTTCTT TTTGTTTTGG TGTAAAGCCT TTTCGAGATT 1465 
AAAAAAGCAT TAATTTGTAG AAAGCTGTGG TAAAACTCTC GATCAAATGA AATAAGATAT 1525 
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Mel Alo Ser Ser Vol Leo Ser Glu Cys Gly Phe Arg Pro Leu Pro Arq 
1 5 10 

Phe Tyr Pro Lys His Thr Thr Ser Phe Alo Ser Asn Pro Lys Pro Thr 

20 25 

Phe Lys Phe Asn Pro Pro Leu Lys Pro Pro Ser Ser Leu Leu Asn Ser 



35 



Arq Tyr Gly Phe Tyr Ser Lys Thr Arg Asn Trp Alo Leu Asn Vol Alo 

50 5S 
Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Thr Glu Arg 



65 



70 



Phe Asp Pro Gly Alo Pro Pro Pro Phe Asn Leu Alo Asp He Arg Alo 

85 9" 



Alo lie Pro Lys His Cys Trp Vol Lys Asn Pro Trp Mel Ser Mel Ser 



100 



Tyr Vol Vol Arg Asp Vol Alo lie Vol Phe Gly Leu Alo Alo Vol Alo 

Alo Tyr Phe Asn Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Alo Gin 
130 135 i'^" 

Gly Thr Mel Phe Trp Alo Leu Phe Vol Leu Gly His Asp Cys Gly His 
145 

Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Vol Alo Gly His Leu 



165 



Leu His Ser Ser He Leu Vol Pro Tyr His Gly Trp Arg He Ser His 

180 

Arq Thr His His Gin Asn His Gly His Vol Glu Asn Asp Glu Ser Trp 
^ 195 200 205 

Pro Glu Ser He Tyr Lys Asn Leu Glu Lys Thr Thr Gin 



His Pro Leu 
210 



215 220 
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Mel Phe Arg Phe Thr Leu Pro Phe Pro Mel Leu Ala Tyr Pro Phe Tyr 
225 230 235 

Leu Trp Asn Arg Ser Pro Gly Lys Gin Gly Ser His Tyr His Pro Asp 
Ser Asp Leu Phe Le» Pro L,s Glu L,s L,s Asp Vol Leu Thr Ser Thr 



260 



Alo C,s Trp Thr Alo Mel Alo Alo Leu Leu Vol Cys Le^ Asn Phe Vol 
Mel Gly Pro He Gin Mel Leu Lys Leu Tyr Gly He Pro Tyr Trp lie 



290 



295 



Phe vol Mel Trp Leu Asp Phe Vol Thr Tyr Leu His His His Gly His 
305 310 315 

Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg 

325 330 -303 

Gly Gly Leu Thr Thr Leu Asp Arg ^p Tyr Gly Trp He Asn Asn lie 
His His ASP lie Gl, Thr His Vol He His His Leu Phe Pro Glo lie 

wWw 



355 



Pro His Tyr His Leu Vol Glu Alo Thr Glu Alo Alo Lys Pro Vol Leu 
370 375 380 

Gly Lys Tyr Tyr Arg Giu Pro Lys Asn Ser Gly Pro Leu Pro Leu His 



385 



390 



Leu Leu Gly Ser Leu lie Lys Ser Mel Lys Gin Asp His Phe Vol Ser 

405 

ASP Thr Gly Asp Vol Vo^ Tyr Tyr Glu Alo Asp Pro Lys Leu Asn Gly 



420 



Gin Arg Thr 
435 
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